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(54) Title: OLFACTORY RECEPTOR SEQUENCES 



(57) Abstract: The present invention provides polynucleotide sequences which encode polypeptides involved in olfactory sensa- 
tion. The present invention also provides the polypeptides encoded by these polynucleotide sequences, vectors comprising these 
polynucleotide sequences and host cells transfected with these polynucleotide sequences. The present invention further provides 
for functional variants and homologues of these polynucleotide sequences and the polypeptides encoded by these polynucleotides. 
Libraries of polypeptides are also provided. Also included in the present invention is the use of these polypeptides and libraries of 
Q polypeptides in screening odorant molecules to determine the correspondence (scent representation, scent fingerprint or scent pro- 
file) between individual odorant receptors (the polypeptides) and particular odorant molecules. Also encompassed by the present 
^ invention is the use of the scent representation, scent fingerprint or scent profile to re-create and edit scents. 
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OLFACTORY RECEPTOR SEQUENCES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims priority benefit of United States Provisional Patent 
Application Serial No. 60/158,615, filed on October 8, 1999, and United States Provisional 
Patent Application Serial No. 60/1 84,809, filed on February 24, 2000. The contents of 
those applications are hereby incorporated by reference herein in their entirety. 

STATEMENT OF RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH 

Not applicable. 

TECHNICAL FIELD 

The present invention is in the field of human olfactory receptors and their use in 
screening for olfactory agonists and antagonists. The present invention pertains to isolated 
nucleotide sequences which encode human olfactory receptors and also to the proteins 
encoded by said nucleotide sequences. The present invention also encompasses vectors 
comprising the nucleotide sequences of the invention and further, host cells transfected 
with said vectors. The present invention also allows for the determination of primary 
scents and the identification of the odor receptors which are encoded to detect these 
primary scents as well as the determination of secondary scents and the identification of 
combinations of odor receptors which are encoded to detect such secondary scents. 
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BACKGROUND ART 

Our sense of smell plays an important role not only in our appreciation of our 
surroundings such as the smell of flowers or new mown grass, but also evolved as a survival 
skill. Numerous odorant molecules can be detected at extremely low concentrations, 
providing early warning of danger, such as the smell of smoke or contaminated food. Indeed, i 
potent example of this is that most pregnant women experience a heightened sense of smell, 
presumably to protect the fetus from the deleterious effects of food poisoning. 

It is estimated that humans can detect millions of different molecular species; however, 
our nose can discriminate only a fraction of these different chemicals (Mombaerts Curr. Opin. 
Genet. Dev. 1999 9, 3 15-320), usually estimated at about 1 0,000 odorants (Axel, Scientific 
American 1995, October, 154-159). Odorants for terrestrial species such as humans, are 
volatile (air born) ligands which are detected by the olfactory system. Odorants have vastly 
different chemical structures and subtle differences can lead to pronounced changes in the 
perceived odor (Mombaerts, supra). For instance, when the hydroxyl group of octanol is 
replaced by a carboxyl group to give octanoic acid, its perceived odor changes from orange and 
rose-like to rancid and sweaty (Malnic et ai, Cell 1999 96, 713-723). The basis for these feats 
of sensory perception are just beginning to be understood at a cellular and molecular level. 

The olfactory system contains millions of olfactory sensory neurons (OSNs) located in 
the olfactory epithelium of the nasal cavity. In humans, the olfactory epithelium occupies an 
area of approximately 5 cm 2 . The OSNs are bipolar with one end extending through the 
supporting cell into the mucosal layer, terminating in hairlike cilia. These cilia are the site of 
the olfactory receptors (OR) where the odorant ligands are thought to bind (Mombaerts Curr. 
Opin. Genet. Dev. 1 999 9, 3 1 5-320, Hildebrand et ai, Annu. Rev. NeuroscL, 1 997, 20, 595- 
63 1). The OSNs also have a single unbranched axon which leads to the olfactory bulb, a part 
of the brain containing approximately 2000 glomeruli where the axons terminate and initial 
processing of the sensory code takes place. OSNs expressing the same OR are randomly 
interspersed throughout the olfactory epithelium, but in both the nose and the bulb, information 
derived from different ORs is strictly segregated; each OSN in the nose and each glomerulus in 
the olfactory bulb appear to be dedicated to input from one or few OR type(s) (Malnic et ai, 
Cell 1999 96, 713-723). It also appears that the location of the glomeruli are conserved across 
individuals of a species, providing the first spatial processing of particular odorant patterns 
(Mombaerts Curr. Opin. Genet. Dev. 1999 9, 315-320). The domains in the olfactory bulb for 
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different odors may overlap, but the overall patterns are distinct (Hildebrand et al, supra), 
therefore it should be possible to identify and reproduce the characteristic pattern of a g.ven 
odorant. Output neurons project from the olfactory bulb to the primary olfactory cortex and 
from there to the higher cortical areas of the brain and to the limbic system (Malmc et al, 
5 supra; Hildebrand et al, supra, 20, 595-631). 

Until the identification of a large family of genes encoding putative odorant receptors 
(Buck&Axeia// 1991 65, 175-187), progress towards understanding the process of odor 
recognition was negligible. In recent years there has been an explosion in this field as more 
and more putative odor receptors an, isolated and cloned. The odorant receptor gene products 
10 have thus far been characterized through homology as seven transmembrane doma.n G protein- 
coupled receptors (GPCR). It is estimated that there are probably 500-750 OR-.ike sequences 
in humans, while there are 500-1000 OR genes in rat and mouse (Mombaerts Curr. Opm. 
Genet. Dev. 1999 9, 315-320). In mice, OR-like sequences make up approximately 1% of the.r 
genome, the largest known family in the mammalian genome, surpassing the complexity of 
15 even the immunoglobin and T-cell antigen receptor gene families (Mombaerts, supra). The OR 
are concentrated on the surface of the OSN's mucus coated cilia and it is thought that odorant 
m olecules bind to the OR in the olfactory epithelium and thereby initiate signal transduct.on. 
Current interpretation of recent experimental evidence favors the idea that each neuron 
expresses only one, or very few, ORs. Since mammals can detect at least 10,000 odors and 
20 there are approximately 1,000 or fewer ORs, each of the ORs must respond to several odorant 
molecules, and each odorant molecule must bind to several receptors. It is believed that 
various receptors respond to discrete parts of an odorant molecule's structure and that an 
odorant consists of several chemical groups each of which bind a characteristic receptor (Axel 
Scientific American 1995, October, 154-159; Malnic et al, Cell 1999 96, 713-723). 
25 The main signal transduction pathway mediated by OR homologues in vertebrate 

species involves G protein-mediated stimulation of adenylyl cyclase activity, resulting .n 
cAMP elevation that opens cyclic-nucleotide gated channels with a non-specific cat.on 
selectivity (Mombaerts Curr. Opi, Genet. Dev. 1999 9, 315-320). However, there are still 
numerous unanswered questions and recently it has come to light that 38-76% of the human 
30 gene OR sequences that are being reported may be pseudogenes and therefore incapable of 

expressing the proteins that encode the olfactory receptors. Some of the incidences may be due 
to the method of extracting the genomic DNA libraries (Mombaerts, supra). Few pseudogenes 
have been found in other vertebrates and their incidence in libraries from testicular DNA ,s also 
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rare (Hildebrand et al, Annu. Rev. Neurosci., 1 997, 20, 595-63 1 ). cDN A should not contain 
pseudogenes. There are a number of examples of ORs which have been successfully 
expressed and reactions to certain odorant ligands have been determined (Malnic et al, Cell 
1999 96, 713-723; Mombaerts, supra; Zhao et al., Science 1998 279, 237-242). 
5 Some attempts to express the ORs in heterologous cell lines resulted in the formation of 

inclusion bodies rather than the insertion of the proteins into the membrane (Kiefer etal., 
infra). However, purification of the receptors after expression in E. coli and their insertion into 
lipid vesicles facilitates the use of these receptors in odorant ligand screening using a 
combination of photoaffinity labeling and Trp fluorescence (Kiefer et al, Biochemistry 1996 
10 35, 1 6077-1 6084). In addition, a functional human OR receptor protein has been expressed in 
HEK-293 cells and oocytes and found to interact with odorant ligands (Wetzel et al, J. 
Neurosci. 1999 19, 7426-7433). There have also been, a number of successful efforts of 
expressing cDNA in insect Sf9 cells using baculovirus vectors (Mombaerts Annu. Rev. 
Neuorsci. 1999) as well as assays with neuronal tissue (Malnic et al., Cell 1999 96, 713-723; 
15 Zhao et al., 1998; Firestein et al., WO 98/50081). In addition, recent work accomplished the 
expression of chimeric mouse olfactory receptor sequences in HEK-293 cells and showed their 
reactivity towards a panel of odorant ligands, some at micromolar concentrations (Krautwurst 
et al., Cell 1998 95 91 7-926). The drawback to expression in heterologous cell systems is the 
lack of working signal transduction pathways which can be used to detect responses to odorant 
20 ligands; these drawbacks can be overcome with methods known in the art (e. g. U.S. Pat. No. 
5,798, 275). There are also methods of expressing and assaying functional neuronal receptors 
in neuronal cells, including methods for detecting particular odorant ligand specificity (Malnic 
et al, supra; Zhao, supra; Firestein et al, supra). 

25 Other publications of interest are: Chemical Senses 6: 343-349 (1981); Proc. Natl. 

Acad. Sci. USA 79: 670-674 (1982); Proc. Natl. Acad. Sci. USA 81(6): 1859-1863 (1984); 
Nature 316: 255-258 (1985); Brain Research 368: 329-338 (1986); J. Biol. Chem. 261: 
1299-1305 (1986); Proc. Natl. Acad. Sci. USA 83(13): 4947-4951 (1986); J. Neurosci. 6: 
2146-2154 (1986); J. Neurochem. 47: 1527-1533 (1986); Chemical Senses 13: 191-204 

30 (1988); Biochem. J. 260:121-126 (1989); J. Biol Chem. 264: 6780-6785 (1989); Biochim. 
Biophys. Acta 1013: 68-72 (1989); J. Biol. Chem. 264: 18803-18807 (1989); Biochemistry 
29: 7433-7440 (1990); FEBS left. 270: 24-29 (1990); Chemical Senses 15: 529-536 (1990); 
Eur. J. Biochem. 196: 51-58 (1991); Nature 349: 790-793 (1991); Neurosci. Lett. 141: 1 15- 
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118 (1992); Developmental Brain Res. 73: 7-16 (1993); Proc. Natl. Acad. Sci., USA 90: 
3715-3719 (1993); Human Mol. Genetics 3: 229-235 (1994); Eur. J. Biochem. 225: 1157- 
1 168 (1994); European Journal of Biochemistry 238: 28-37 (1996); Receptors and 
Channels 4: 141-147 (1996); Genomics 37(2): 147-160 (1996); Protein Science 8: 969-977 

(1999) ; Genomics 53: 56-68 (1998); Genomics 61:24-36 (1999); Genomics 63: 227-245 

(2000) - Trends in Neurosci. 7:35-36 (1984); Ann. Rev. Neurosci. 9:329-355 (1986); Trends 
Biochem. Sci. 12:63-66 (1987); Nature 351: 275-276 (1991); Nature 353: 799-800 (1991); 
Current Biol. 3(10): 668-674 (1993); Nature 372:321-322 (1994); Essays in Biochenustry. 
33- 93-104 (1998); and Nature, 398 (6725): 285-287 (1999). 

However, despite the forgoing, there has been relatively little work with human 
olfactory receptors, in particular in determining the sequences of large numbers of receptors, 
and less progress in determining the correspondence between particular human olfactory 
receptors and the scent(s) to which they respond. 

All publications cited herein are hereby incorporated by reference in their entirety. 



mS m oqttrF OF THE INVENTION 



20 



An object of the invention is to determine the correspondence between ORs and the 
scent(s) to which they respond. Once this is accomplished, scents can be both analyzed and re- 
created for enhancing human experiences or eliciting particular responses. The present 
invention pertains to isolated polynucleotide sequences encoding polypeptides mvolved m 
olfactory sensation. The present invention also pertains to the proteins encoded by sa,d 
nucleotide sequences. The present invention also encompasses vectors compnsmg the 
25 nucleotide sequences of the invention and further, host cells transfected with said vectors. 
The present invention also allows for the determination of primary scents and the 
identification of the odor receptors which are encoded to detect these primary scents as well 
as the determination of receptor complex scent components and the identification of 
combinations of odor receptors which are encoded to detect such receptor complex scent 
30 components scents. 
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The invention provides isolated polynucleotide sequences encoding polypeptides 
involved in olfactory sensation that are isolated from human olfactory epithelial tissue. The 
invention further provides expression vectors containing such nucleotide sequences. Also 
provided by the invention are purified polypeptides encoded by the nucleotide sequences. The 

5 invention further provides transformed cells which comprise a suitable host cell transfected 
with a suitable expression vector containing the nucleotide sequence encoding the receptor. 
The present invention also encompasses nucleotide sequences isolated from human olfactory 
epithelial tissue which encode receptors capable of binding odorant molecules. The invention 
further provides expression vectors containing such nucleotide sequences and homologues of 

1 0 both the polynucleotides and polypeptides. Further, the invention provides a means of using 
the nucleotide sequences of the invention in a method of screening odorant ligands to determine 
the specific binding of odorant molecules to a particular receptors, and further, determining the 
component odorant molecules of subjectively experienced smells, determining the combination 
odorant molecules and receptor stimulation or inhibition to re-create a particular scent. The 

1 5 binding of odorant molecules by the receptors encompassed in the present invention includes 
binding resulting in both the agonism (excitation/activation) and antagonism 
(inhibition/blocking) of receptor function(s) upon binding of the molecule. 

Accordingly, the invention includes an isolated polynucleotide comprising a sequence 
encoding a polypeptide which is involved in olfactory sensation. The OR polypeptides 

20 encoded are found within the sequences depicted in polynucleotide sequences SEQ ID NO: 1 
through SEQ ID NO: 73 and SEQ ID NO: 1 1 1 through SEQ ID NO: 1 52, or a nucleotide 
sequence at least 95% homologous to said sequences. The invention also encompasses the 
translation products of those sequences. The invention further comprises expression vectors 
comprising said sequences, host cells containing such expression vectors and/or expressing the 

25 polypeptide encoded therein, or phage displaying the polypeptide encoded by the sequences. 
The use of functional fragments of receptors is also encompassed by the invention. 
Preparations of receptors, further including biological or synthetic molecules which maintain 
the stability and functional structure of the receptors, are also included in the invention. The 
invention further encompasses fragments of said polynucleotides which can be used as probes 

30 or primers to identify additional polynucleotide sequences through techniques known in the art, 
including those fragments depicted in SEQ ID NOs: 74-1 05. 

The invention also includes additional isolated polynucleotide comprising a sequence 
encoding a polypeptide which is involved in olfactory sensation. The OR polypeptides 
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encoded are found within the sequences depicted in polynucleotide sequences SEQ ID NO:153 
through SEQ ID NO: 1084, or a nucleotide sequence at least 95% homologous to said 
sequences. The invention also comprises the translation products of those sequences. The 
invention further comprises expression vectors comprising said sequences, host cells containing 

5 such expression vectors and/or expressing the polypeptide encoded therein, or phage displaying 
the polypeptide encoded by the sequences. The use of functional fragments of receptors is also 
encompassed by the invention. Preparations of receptors, further including biological or 
synthetic molecules which maintain the stability and functional structure of the receptors, are 
also included in the invention. 

10 The invention also encompasses an isolated and purified olfactory receptor 

polypeptide scomprising the sequence of SEQ ID NO: 1085 through SEQ ID NO: 2008, or 
a polypeptide sequence that is at least about 95% homologous to a polypeptide sequence of 
the group consisting of SEQ ID NO: 1085 through SEQ ID NO: 2008 and having olfactory 
receptor function. Host cells expressing such polypeptides and phages displaying such 

1 5 polypeptides are also encompassed by the invention. The use of functional fragments of 
receptors is also encompassed by the invention. Preparations of receptors, further including 
biological or synthetic molecules which maintain the stability and functional structure of the 
receptors, are also included in the invention. 

Scents can be captured, analyzed and recorded by a sensory device using various 

20 methods. Scent capture can be initiated by the user or by an automatic sensing system. A scent 
can be analyzed in terms of its interaction with olfactory neurons of a mammalian, preferably 
human, olfactory system, or by the expression of individual receptors under appropriate 
conditions and appropriate assay conditions in multiwell plates or in terms of its perception by 
a panel of mammalian, preferably human, subjects. The interaction with olfactory neurons can 

25 be determined experimentally, in vitro, by determining the interaction of an odorant with 

olfactory receptors of a given type. Alternatively, the interaction with olfactory receptor can be 
determined using a computer simulation which provides information regarding the interaction 
of an odorant with the olfactory receptors. A panel of subjects can be used to represent odors 
in terms of their perception. The data so generated can be used to represent a scent in a manner 

30 which can be recorded in digital or other format, stored in media such as computer memory, 
disks, or printed format, and transmitted over a data network. The representation of the scent 
can be used to re-create the scent at a local or remote site using an emitter module. The 



7 



10 



PCT/US00/27582 

WO 01/27158 

. i . VoKIr asnects are attenuated or eliminated, 
enhanced or added and undesirable aspects are <u 

' Accorfing.,, the — also embraces libraries of o.factory receptors suttable 
for a— - — pattern of a compositton with meters, — *e 

73 SEQ1DNO:... through SEQ ID NO: 1 52, and SEQ ID NO: 53 through ^ 
3084 where the polynucleotides encode functional olfactory receptors; or fcncttona. 

are also encompassed by the invention. 

A* encotnpassed by the invention are libraries of olfactory receptors snrtable for 
determhung the interaction pattern of a composition with the receptors, compnsmg a, en* 
"peptides of SEO ID NO: I0S5 through SEQ ID NO: 2003, where dte poises 
I 'JZ olf-ory receptors; or functional friars of the po,ypep, ; de, Ltbranes of 
at leas. 50, .00, 200, or 500 receptors are also encompassed by fte tnvenfon. 

Tbe invenoon a,so embraces methods for defining me binding pattern of a 
composition with olfactory receptors, invo.ving exposing the composition to an olfacror^ 
"«*ary,andde— g Werner me composirion binds » each « recep.r, 
teeby determining the overall binding patter of me composition. In addmonal 

0 which the composition, or me various chemicals within the composttton btnd 30 ft. 

^Igfteabso^amoun.ofacdvaUon.oramoun.ofacrivadon^ve^ofter 

or a contto, substance. Tbe composttion can consis, essentially of one compound 
orchemical.orcancompriseatlcast.wocompoundsorchcm.ca., 

The invention also embraces DNA arrays or DNA chips compnsmg the DNA 
segments derived from any combination of, or each of, SEQ ID NO: .53 through SEQ ID 
J,!. V* tnvention also embraces a method of determining differences among one or 
more individuals with respect ,0 their olfactory faculties, comprising the steps of 
comparing the olfactory DNA of each individual against the array or ch.p. 
30 The invention also embraces a method to determine single nucleotide 

^morphisms in o.factory receptors, comprising the steps of uni q ue,y amp.*mg 
,fL "receptor sconces from DNA obtained from one or more indtvtduals, based on 
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pnmers designed according to the first 25 bases and the last 25 bases of any combination 
of or each of, SEQ ID NO: 153 through SEQ ID NO: 1084, and determining the 
similarities and differences between said amplified DNA and the corresponding receptor 
from SEQ ID NO: 153 through SEQ ID NO: 1084. 
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Brief Description of the Drawings 

Figure 1 depicts the isolated polynucleotide sequences, which encode polypeptides involved in 
10 olfactory sensation, corresponding to SEQ ID NOs: 1-73. 

Figure 2 depicts the isolated polynucleotide sequences, which encode polypept.des .nvolved m 
olfactory sensation, corresponding to SEQ ID NOs: 1 1 1 - 1 52. 



Petajled Description of the Invention 

The present invention provides isolated polynucleotides comprising sequences that 
encode polypeptides which are involved in olfactory sensation and which can be used to screen 
odorant ligands, e.g., odorant receptor agonists and antagon.sts. 



15 



20 Definitions , . , _ t 

term "olfac,ory receptor" (OR) refers ,o a polypeptide involved ,n olfactory 

sensation. An "olfactory recep,or polynucleotide" or "OR polynucleotide" is a polynucleonde 

encoding a polypeptide involved in olfactory sensation. 

The term "odorant ligand" as employed herein refers ,o a molecule that has the 

25 potential to bind to an olfactory receptor. Equivalent temts employed herein include "odoran, . 

Odorant molecule" and "odoran, compound". The term "binding" or "interacnon as used 

herein with respect to odoran, ligands refers ,o «he in,erac,ion of ligands wi,h me recep,or 

polypeptide where ,h= ligands may serve as either agonis,s and/or antagonists of a gtven 

Lpto, or recent function. An odoran, ligand may thus directly cause a percepuon of odor 

30 ,an agonist), o, may block the perception of odor (an antagonist, Au odoran. ligand may 

Llude, but is no. limited to, molecules which interact wUh polypnea .nvolved m olfac.ory 
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sensation. Odorant iigands and molecules which interact with olfactory receptors are generally 
small, approximately 1000 Daltons, more preferably approximately 750 Daltons, more 
preferably approximately 500 Daltons, or even more preferably approximately 300 Daltons, 
hydrophobic molecules with a variety of functional groups. Small changes in structure can 
induce profound changes in odorant ligand binding and hence in the odor perceived by an 
individual. 

A more detailed description of these sequences, as well as how these sequences were 
obtained, is provided below. 



As used herein, a "polynucleotide" is a polymeric form of nucleotides of any length, 
which contain deoxyribonucleotides, ribonucleotides, and/or their analogs. The terms 
"polynucleotide", "nucleotide" and "nucleic acid" as used herein are used interchangeably. 
Polynucleotides may have any three-dimensional structure, and may perform any function, 
1 5 known or unknown. The term "polynucleotide" includes double- , single-stranded, and triple- 
helical molecules. Unless otherwise specified or required, any embodiment of the invention 
described herein that is a polynucleotide encompasses both the double-stranded form and each 
of two complementary single-stranded forms known or predicted to make up the double 
stranded form. Not all linkages in a polynucleotide need be identical. 
20 The following are non-limiting examples of polynucleotides: a gene or gene fragment, 

exons, introns, mRNA, tRNA, rRNA, ribozymes, cDNA, recombinant polynucleotides, 
branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of 
any sequence, nucleic acid probes, primers, and adaptors. A polynucleotide may comprise 
modified nucleotides, such as methylated nucleotides and nucleotide analogs. The use of uracil 
25 as a substitute for thymine in a deoxyribonucleic acid is also considered an analogous form of 
pyrimidine. 

In the context of polynucleotides, a "linear sequence" or a "sequence" is an order of 
nucleotides in a polynucleotide in a 5' to 3' direction in which residues that neighbor each other 
in the sequence are contiguous in the primary structure of the polynucleotide. A "partial 
30 sequence" is a linear sequence of part of a polynucleotide which is known to comprise 
additional residues in one or both directions. 

If present, modification to the nucleotide structure may be imparted before or after 
assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide 
components. A polynucleotide may be further modified after polymerization, such as by 

10 
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conjugation with a labeling component. Other types of modifications included in this definition 
are for example, "caps", substitution of one or more of the naturally occurring nucleot.des w.th 
an analog, intemucleotide modifications such as, for example, those with uncharged hnkages 
(e g., methyl phosphonates, phosphotriesters, phosphoamidates, cabamates, etc.) and wuh 
5 charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), those containing pendant 
moieties, such as, for example, proteins (e.g., nucleases, toxins, antibodies, signal pept.des, 
poly-L-lysine, etc.), those with intercalators (e.g., acridine, psoralen, etc.), those containing 
chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing 
alkylators, those with modified linkages (e.g., a-anomeric nucleic acids, peptide nucleic acids, 
10 etc.), as well as unmodified forms of the polynucleotide(s). 

Further, any of the hydroxyl groups ordinarily present in the sugars may be replaced by 
phosphonate groups, phosphate groups, protected by standard protecting groups, or activated to 
prepare additional linkages to additional nucleotides, or may be conjugated to sohd supports. 
The 5' and 3' terminal OH groups can be phosphorylated or substituted with amines or organ.c 
15 capping group moieties of from 1 to 20 carbon atoms. Other hydroxyls may also be derivat,zed 

to standard protecting groups. 

Polynucleotides can also contain analogous forms of ribose or deoxyribose sugars that 
are generally known in the art, including, but not limited to, 2'-0-methyl-, 2'-0-allyl, T- 
fluoro- or 2'-azido-ribose, carboxcyclic sugar analogs, a-anomeric sugars, epimeric sugars such 
20 as arabinose, xyloses or lyxoses, pyranose sugars, furanose sugars, sedoheptuloses, acyclic 
analogs and abasic nucleoside analogs such as methyl riboside. 

Although conventional sugars and bases will be used in applying the method of the 
invention, substitution of analogous forms of sugars, purines and pyrimidines can be 
advantageous in designing a final product, as can alternative backbone structures like a 
25 polyamide backbone such as those used in peptide nucleic acids (PNAs). 

A polynucleotide or polynucleotide region has a certain percentage (for example, 75%, 
80%, 85%, 90%, 95% or 99%) of "sequence identity" to another sequence means that, when 
aligned, that percentage of bases are the same in comparing the two sequences. 

Homology, as described herein, means that the polypeptide sequences that are encoded 
30 by the nucleic acids demonstrate a certain relatedness (i.e., there exists regions of conserved 
amino acids), but not the same amino acid identity. There is complete or 1 00% homology at a 
particular amino acid residue when the amino acids of sequences being compared are the same 
(there is identity) or represent a conservative amino acid substitution (there is homology). A 
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"conservative amino aeid subs.Uu.ion" occurs when a parricular amino acid is substituted by an 
alurnau amino acid of similar charge density, hydrophobiciry/hydrophiliciry, she and/or 
configuu.ion (e.g., Val for lie). A "nonconservative amino acid substiu.ion" occurs when a 
particular amino acid is substituted by an alternative amino acid of differing properties, that ,s, 
5 charge density, hydrophobiciry/hydrophUiciry, size and/or configuration (e.g., Val for Tyr). 
The nucleie acid sequences within the scope of the present invention include those nucle,c 
acids which differ in exact sequence from .hose listed in SEQ .D NO:! through SEQ ID 
NO:73 and SEQ ID NO:l 11 through SEQ ID NOtl 52 but which encode idenucal or 
homologous polypeptide amino acid sequences. 
,0 A "primer" is a short polynucleotide, generally with a free V -OH group, .hat bmds to a 

target potentially present in a sample of intents, by hybridizing with the Urge., and thereafter 
promoting polymerization of a polynucleotide complementary to the target. 

An "adaptor" is a short, partially-duplexed polynucleotide that has a blunt, double- 
stranded end and a protruding, aingle-stranded end. 1. can be ligared, through its double- 
, 5 stranded end, to the double-stranded end of another polynucleotide. This provides known 
sequences at the ends of thus modified polynucleotides. Often adaptor contam specfic 
sequences for primer binding and/or restriction endonuclease digestion. 

A "probe" when used in ore context of polynucleotide manipulation refers to a 
polynucleotide which is provided as a reagen, to de.ec, a urge, po,en,ially present in a sample 
20 of inures, by hybridizing with the urge,. Usually, a probe will comprise . label or a means by 
which a label can be attached, either before or subsequen, to the hybridization reacuon. 
Soluble labels include, bu, are no, limited ,o radioisotopes, fluorochromes, chemiluminescen, 

compounds, dyes, and enzymes. 

•Transformation" or "transfection" refers to the insertion of an exogenous 
25 polynucleotide into a host ceil, irrespective of the method used for the insertion, for example, 
.ipofection, transduction, infection or electroporation. The exogenous polynucleotide may be 
m aintained as a non-integrated vector, for example, a plasmid, or alternatively, may be 

integrated into the host cell genome. 

A polynucleotide is said to "encode" a polypeptide if, in its native state or when 
30 manipulated by methods well known to those skilled in the art, it can be transcribed and/or 
translated to produce the polypeptide, a homologous polypeptide or a fragment thereof. For 
purposes of this invention, and to avoid cumbersome referrals to complementary strands, the 
anti-sense (or complementary) strand of such a polynucleotide is also said to encode the 
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sequence; that is, a polynucleotide sequence that "encodes" a polypeptide includes both the 
conventional coding strand and the complementary sequence (or strand). 

The terms "polypeptide", "oligopeptide", "peptide" and "protein" are used 
interchangeably herein to refer to polymers of amino acids of any length. The polymer may be 
linear or branched, it may comprise modified amino acids, it may be interrupted by non-amino 
acids, and it may be assembled into a complex of more than one polypeptide chain. The terms 
also encompass an amino acid polymer that has been modified naturally or by intervention; for 
example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or 
any other manipulation or modification, such as conjugation with a labeling component. Also 
included within the definition are, for example, polypeptides containing one or more analogs of 
an amino acid (including, for example, unnatural amino acids, etc.), as well as other 

modifications known in the art. 

In the context of polypeptides, a "linear sequence" or a "sequence" is an order of amino 
acids in a polypeptide in an N-terminal to C-terminal direction in which residues that neighbor 
each other in the sequence are contiguous in the primary structure of the polypeptide. A 
"partial sequence" is a linear sequence of part of a polypeptide which is known to comprise 
additional residues in one or both directions. 

"Recombinant," as applied to a polynucleotide or gene, means that the polynucleotide 
is the product of various combinations of cloning, restriction and/or ligation steps, and other 
procedures that result in a construct that is distinct from a polynucleotide found in nature. 

A "vector" is a self-replicating nucleic acid molecule that can be used to transfer an 
inserted nucleic acid molecule into and/or between host cells. The term includes vectors that 
function primarily for insertion of a nucleic acid molecule into a cell, vectors that function 
primarily for the amplification of nucleic acid, and expression vectors that function for 
transcription and/or translation of the DNA or RNA. Also included are vectors that provide 
more than one of the above functions. 

"Expression vectors" are defined as polynucleotides which, when introduced into an 
appropriate host cell, can be transcribed into a mRNA capable of being translated into a 
polypeptide(s). An expression vector also comprises control elements operatively linked to the 
coding region to enable and/or facilitate expression of the polypeptide in the target cell. These 
can include transcriptional, translation^, posttranscriptional, and posttranlational control 
elements, as are known in the art. An "expression system" usually connotes a suitable host cell 
comprised of an expression vector that can function to yield a desired expression product. 
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A "host cell" includes an individual cell or cell culture which can be or has been a 
recipient for vectors) or for incorporation of nucleic acid molecules and/or proteins. Host cells 
include progeny of a single host cell, and the progeny may not necessarily be completely 
identical (in morphology or in genomic or total DNA complement) to the original parent cell 
due to natural, accidental, or deliberate mutation. A host cell includes cells transfected in vivo 
with a polynucleotide(s) of this invention. 

A "cell line" or "cell culture" denotes eukaryotic cells, derived from higher, multicellular 
organisms, grown or maintained in vitro. It is understood that the descendants of a cell may not be 
completely identical (either morphologically, genotypically, or phenotypically) to the parent cell. 
Cells described as "uncultured" are obtained directly from a living organism, and are generally 
maintained for a limited amount of time away from the organism (i.e., not long enough or under 
conditions for the cells to undergo substantial replication). 

As used herein, "expression" includes transcription and/or translation. 

"Heterologous" means derived from (i.e., obtained from) a genotypically distinct entity 
from the rest of the entity to which it is being compared. For example, a polynucleotide may be 
placed by genetic engineering techniques into a plasmid or vector derived from a different 
source, thus becoming a heterologous polynucleotide. A promoter which is linked to a coding 
sequence with which it is not naturally linked is a heterologous promoter. 

An "isolated" or "purified" polynucleotide, polypeptide or cell is one that is 
substantially free of the materials with which it is associated in nature. By substantially free is 
meant at least 50%, preferably at least 70%, more preferably at least 80%, even more 
preferably at least 90%, even more preferably at least 99%, and even more preferably at least 
99.9% free of the materials with which it is associated in nature. As used herein, an "isolated" 
polynucleotide or polypeptide also refers to recombinant polynucleotides or polypeptides, 
which, by virtue of origin or manipulation: (1) are not associated with all or a portion of a 
polynucleotide or polypeptide with which they are associated in nature, (2) are linked to a 
polynucleotide or polypeptide other than that to which they are linked in nature, or (3) do not 
occur in nature, or (4) in the case of polypeptides, arise from expression of recombinant 
polynucleotides. Thus, for example, an isolated substance may be prepared by using a 
purification technique to enrich it from a source mixture. Enrichment can be measured on an 
absolute basis, such as weight per volume of solution, by specific activity or it can be measured 
in relation to a second, potentially interfering substance present in the source mixture. 
Increasing enrichments of the embodiments of this invention are increasingly more preferred. 
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Thus, for example, a 2-fold enrichment is preferred, 10-fold enrichment is more preferred, 100- 
fold enrichment is more preferred, 1000-fold enrichment is even more preferred. A substance 
can also be provided in an isolated state by processes such as chemical synthesis or 

recombinant expression. 

A "reagent" polynucleotide, polypeptide, or antibody, is a substance provided for a 
reaction, the substance having some known and desirable function in the reaction. A reaction 
mixture may also contain a "target", such as a polynucleotide, antibody, polypeptide, or 
assembly of polypeptides that the reagent is capable of reacting with. For example, in some 
types of diagnostic tests, the presence and/or amount of the target in a sample is determined by 
adding a reagent, allowing the reagent and target to react, and measuring the amount of reacfon 
product (if any). 

"Hybridization" refers to a reaction in which one or more polynucleotides react to form 
a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. 
The hydrogen bonding may occur by Watson-Crick base pairing, Hoogstein binding, or in any 
other sequence-specific manner. The complex may comprise two strands forming a duplex 
structure, three or more strands forming a multi-stranded complex, a single self-hybridizing 
strand, or any combination of these. A hybridization reaction may constitute a step in a more 
extensive process, such as the initiation of an amplification reaction such as PCR, or the 
enzymatic cleavage of a polynucleotide by a ribozyme. 

When hybridization occurs in an antiparallel configuration between two single-stranded 
polynucleotides, those polynucleotides are described as "complementary". A double-stranded 
polynucleotide can be "complementary" to another polynucleotide if hybridization can occur 
between one of the strands of the first polynucleotide and the second. The degree to which one 
polynucleotide is complementary with another is quantifiable in terms of the proportion of bases in 
opposing strands that are expected to form hydrogen bonds with each other, according to generally 
accepted base-pairing rules of A-T, A-U and G-C. 

A "stable duplex" of polynucleotides, or a "stable complex" formed between any two 
or more components in a biochemical reaction, refers to a duplex or complex that is sufficiently 
long-lasting to persist between formation of the duplex or complex and subsequent detection, 
including any optional washing steps or other manipulation that may take place in the interim. 

A substance is said to be "selective" or "specific" if it reacts or associates more 
frequently, more rapidly, with greater duration and/or with greater affinity with a particular cell 
or substance than it does with alternative cells or substances. An odorant ligand "specifically 
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binds" to a target if it binds with greater affinity, avidity, more readily, and/or with greater 
duration than it binds to other substances. 

As used herein, "naturally occurring," "native," or "wild type" refers to endogenous 
polynucleotides and the protein(s) expressed thereby. These terms include full-length and 
processed polynucleotides and polypeptides. Processing can occur in one or more steps, and these 
terms encompass all stages of processing. For instance, polypeptides having or lacking a signal 
sequence are encompassed by the invention. "Non-natural ly occurring", "non-native", or "non- 
wild type" refer to all other polynucleotides and polypeptides. 

A "polymerase chain reaction" ("PCR") is a reaction in which replicate copies are made 
of a target polynucleotide using one or more primers, and a catalyst of polymerization, such as 
a reverse transcriptase or a DNA polymerase, and particularly a thermally stable polymerase 
enzyme. Methods for PCR are taught in U.S. Patent Nos. 4,683,195 (Mullis) and 4,683,202 
(Mullis et «!.). All processes of producing replicate copies of the same polynucleotide, such as 
PCR or gene cloning, are collectively referred to herein as "amplification." 

According to this invention, a "genomic DNA library" is a clone library which contains 
representative nucleotide sequences from the DNA of a given genome. It is constructed using 
various techniques that are well known in the art, for instance, by enzymatically or 
mechanically fragmenting the DNA from an organism, organ, or tissue of interest, linking the 
fragments to a suitable vector, and introducing the vector into appropriate cells so as to 
establish the genomic library. A genomic library contains both transcribed DNA fragments as 
well as nontranscribed DNA fragments. 

In comparison, a "cDNA library" is a clone library that differs from a genomic library 
in that it contains only transcribed DNA sequences and no nontranscribed DNA sequences. It 
is established using techniques that are well known in the art, i.e., selection of mRNA (e.g. by 
polyA) making single stranded DNA from a population of cytoplasmic mRNA molecules using 
the enzyme RNA-dependent DNA polymerase (i.e., reverse transcriptase), converting the 
single-stranded DNA into double-stranded DNA, cloning the resultant molecules into a vector, 
and introducing the vector into appropriate cells so as to establish the cDNA library. 
Alternately, a cDNA library need not be cloned into a vector and/or established in cells, but can 
be screened using PCR with gene-specific primers, as is well known in the art. 

An "individual" is a vertebrate, preferably a mammal, more preferably a human. 



General Techniques 
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The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology (including recombinant techniques), 
microbiology, cell biology and biochemistry, which are within the skill of the art. Such 
techniques are explained fully in the literature, such as: "Molecular Cloning: A Laboratory 
Manual", second edition (Sambrook et al., 1989); "Oligonucleotide Synthesis" (M.J. Gait, ed., 
1984); "Animal Cell Culture" (R.I. Freshney, ed., 1987); "Methods in Enzymology" (Academi 
Press, Inc.); "Gene Transfer Vectors for Mammalian Cells" (J.M. Miller & M.P. Calos, eds., 
1987); "Current Protocols in Molecular Biology" (F.M. Ausubel et al., eds., 1987 and annual 
updates); "PCR: The Polymerase Chain Reaction", (Mullis et al., eds., 1994); "Current 
Protocols in Immunology" (J.E. Coligan et al., eds., 1991). 



Basis for identification and description of the polynucleotides and polypeptides 

The polynucleotide sequences were identified using oligonucleotide primers which 
were complementary to OR membrane-spanning regions. A number of different primers 
were used to elicit a variety of nucleotide sequences which encode polypeptides involved in 
olfactory sensation. The identification and isolation of nucleotide sequences which encode 
polypeptides involved in olfactory sensation and the polypeptides that they encode is vital 
for determining the response of receptors to odorant molecules, the elucidation of scent 
representations, profiles, or fingerprints, the reproduction of scent representations, profiles, 
or fingerprints and the editing of scent representations, profiles, or fingerprints. 

Polynucleotides encoding polypeptides involved in olfactory sensation 

The present invention provides isolated polynucleotides encoding polypeptides which 
are involved in olfactory sensation, vectors containing these polynucleotides, host cells 
containing these polynucleotides, and compositions comprising these polynucleotides. These 
polynucleotides are isolated and/or produced by chemical and/or recombinant methods, or a 
combination of these methods. The present invention includes polynucleotides isolated from 
the human olfactory epithelium which encode polypeptides which are involved in olfactory 
sensation, vectors containing these polynucleotides, host cells containing these polynucleotides, 
and compositions comprising these polynucleotides. Unless specifically stated otherwise, 
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"polynucleotides" shall include all embodiments of the polynucleotides of this invention. 
These polynucleotides are useful as probes, primers, in expression systems, and, in a preferred 
embodiment, in screening methods as described herein. In one embodiment the 
polynucleotides of the present invention can be isolated by creating a cDNA library using 
5 template RNA from human olfactory epithelium tissue. A detailed example is related in 
Example 1, below. 

The advantage of constructing a cDNA library for isolation of the desired nucleotide 
sequences is that the likelihood of obtaining pseudogenes is greatly reduced compared to using 
a genomic DNA library for the same purpose. cDNA libraries contain only mRNA expressed 

1 0 in the tissue used for the construction of the library, in this case, the human olfactory 

epithelium. The preferred olfactory epithelium tissue should express only those nucleotide 
sequences which are relevant for olfactory function, thereby excluding nonfunctioning 
pseudogenes and also GPCRs which may be similar in primary structure (amino acid sequence) 
but are not encoded in OSNs. As the number of GPCRs utilized in human signal transduction 

1 5 pathways is extremely wide and varied, cDNA libraries constructed using olfactory tissue are 
preferable for isolating nucleotide sequences that encode polypeptides which are involved in 
olfactory sensation, inasmuch as genomic libraries can contain abundant nucleotide sequences 
which encode for a variety of GPCRs performing numerous functions, and are likely to contain 
pseudogenes. 

20 The isolation of polynucleotide sequences which encode polypeptides involved in 

olfactory sensation is described in Example 1. Accordingly, this invention provides isolated 
polynucleotides that contain sequences encoding polypeptides or portions thereof which are 
involved in olfactory sensation, wherein the polypeptide is at least 10 amino acids in length, 
and wherein the polynucleotide sequences are depicted in SEQ ID NOs: 1-73 and SEQ ID 

25 NOs:lll-152. 

The invention includes modifications to said polynucleotides described above 
such as deletions, substitutions, additions, or changes in the nature of any nucleic acid moieties. 
A "modification" is any difference in nucleotide sequence as compared to a polynucleotide 
shown herein to encode a polypeptide involved in olfactory sensation, and/or any difference in 
30 the nucleic acid moieties of the polynucleotide(s), wherein such a modified polynucleotide 
encodes a polypeptide involved in olfactory sensation or a variant of said polypeptide that is 
useful in the practice of the invention. Such changes can be useful to facilitate cloning and 
modify expression of polynucleotides encoding polypeptides which are involved in olfactory 
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sensation. Such changes also can be useful for conferring desirable properties to the 
polynucleotide(s), such as stability. The definition of polynucleotide provided herein gives 
examples of these modifications. Hence, the invention also includes variants of the nucle.c 
acid sequences disclosed herein, which include nucleic acid substitutions, additions, and/or 
deletions. 

The invention also encompasses polynucleotides encoding polypeptides involved in 
olfactory sensation, including polynucleotides that are full-length, processed, coding, non- 
coding (including flanking region) or portions thereof, provided that these polynucleotides 
contain a region encoding at least a portion of a polypeptide involved in olfactory sensat.on. 
(That is, the region encodes a functional fragment of an olfactory receptor or other polypept.de 
involved in olfactory sensation.) Also embodied are the mRNA, cDNA and genomic DNA 
sequences and fragments thereof that include a polynucleotide sequence comprising a coding 
sequence for a portion of a polypeptide involved in olfactory sensation. 

Genes encoding human olfactory receptors, and optionally including related genom.c 
sequences such as regulatory sequences, can be obtained using olfactory receptor cDNAs as 
hybridization probes. Under high stringency hybridization conditions, an OR cDNA w.ll 
hybridize to its cognate OR gene. Use of lower stringency hybridization conditions allows the 
isolation of OR genes that are related to, but not identical with, the gene corresponding to a 
particular OR cDNA. 

20 Conditions for hybridization are well-known to those of skill in the art and can be 

varied within relatively wide limits. Hybridization stringency refers to the degree to which 
hybridization conditions disfavor the formation of hybrids containing mismatched nucleotides, 
thereby promoting the formation of perfectly matched hybrids or hybrids containing fewer 
mismatches; with higher stringency correlated with a lower tolerance for mismatched hybnds. 
25 Factors that affect the stringency of hybridization include, but are not limited to, temperature, 
P H ionic strength, and concentration of organic solvents such as formamide and 
dimethylsulfoxide. As is well known to those of skill in the art, hybridization stringency is 
increased by higher temperatures and/or lower ionic strengths. See, for example, Ausubel et 
at., supra; Sambrook et al., supra; M.A. Innis et al. (eds.) PCR Protocols, Academic Press, San 
30 Diego, 1 990; B.D. Hames et al. (eds.) Nucleic Acid Hybridisation: A Practical Approach, IRL 
Press, Oxford, 1985; and van Ness et al., (1991) Nucleic Acids Res. 19:5143-5151. The 
degree of stringency can be adjusted not only during a hybridization reaction, but also in post- 
hybridization washes, as is known to those of skill in the art. 
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The invention also encompasses polynucleotides encoding polypeptides involved in 
olfactory sensation, functionally equivalent variants and derivatives of full-length polypeptides 
involved in olfactory sensation and functionally equivalent fragments. For instance, changes in 
a DNA sequence that do not change the encoded amino acid sequence, as well as those that 
result in conservative substitutions of amino acid residues, non-deleterious non-conservative 
substitutions, one or a few amino acid deletions or additions, and substitution of amino acid 
residues by amino acid analogs, will not significantly affect properties of the encoded 
polypeptide. Polypeptides homologous to the polypeptides encoded by the polynucleotides 
described herein can also be identified using algorithms and methods well-known to those of 
skill in the art, such as those described in Ausubel, "Current Protocols in Molecular Biology," 
Chapter 19; see also Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) 
"Basic local alignment search tool." J. Mol. Biol. 215:403-410; Gish, W. & States, D.J. (1993) 
"Identification of protein coding regions by database similarity search." Nature Genet. 3:266- 
272; Madden, T.L., Tatusov, R.L. & Zhang, J. (1996) "Applications of network BLAST 
server" Meth. Enzymol. 266:131-141; Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., 
Zhang, Z., Miller, W. & Lipman, D.J. (1997) "Gapped BLAST and PS1-BLAST: a new 
generation of protein database search programs." Nucleic Acids Res. 25:3389-3402; and 
Zhang, J. & Madden, T.L. (1997) "PowerBLAST: A new network BLAST application for 
interactive or automated sequence analysis and annotation." Genome Res. 7:649-656. A 
preferred method of determining homology is the BLAST set of similarity search programs 
(Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) "Basic local 
alignment search tool." J. Mol. Biol. 21 5:403-410. Polypeptides which are 40% homologous, 
50% homologous, 60% homologous, 70% homologous, 80% homologous, 90% homologous, 
95% homologous, or 99% homologous to the polypeptides encoded by the polynucleotides 
described herein are encompassed by the invention. 

• Nucleotide substitutions that do not alter the amino acid residues encoded can be useful 
for optimizing gene expression in different systems. Suitable substitutions are known to those 
of skill in the art and are made, for instance, to reflect preferred codon usage in the particular 
expression systems. In another example, alternatively spliced polynucleotides can give rise to 
different functionally equivalent fragments or variants of an polypeptide involved in olfactory 
sensation. Alternatively processed polynucleotide sequence variants are defined as 
polynucleotide sequences corresponding to mRNAs that differ in sequence from one another 
but are derived from the same genomic region, for example, mRNAs that result from: 1) the 
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use of alternative promoters 
alternative splice sites. 



; 2) the use of alternative polyadenylation sites; and/or 3) the use of 



Preparation of polynucleotides involved in olfactory sensation 
, The polynucleotides of this invention can be obtained using chem.ca. synthes.s, 

recombinant methods, or PCR. 

Methods of chemical polynucleotide synthesis are well known in the art and need no. 
be described in detail herein. One of skill in the at, oan use the sequences provided heretn and 
a commercial DNA synthesizer to produce a desired DNA sequence. 
0 For preparing polynucleotides which encode polypeptides involved in olfactory 

sensation using recombinant methods, a polynucleotide comprising a desired sequence can be 
inserted into a suitable vector, and me vector in htm can be introduced into a suttable host cell 
for replication and ampliation. Polynucleotides may be inserted into host cells by any means 
known in the art. Cell, are transformed by introducing an exogenous polynucleor.de by dtrec. 
5 uptake, endocytosi, transfecdon, F-mating, particle bombardment, liposome mediation, or 
electroporation. Once introduced, an exogenous polynucleotide can be maintained wtth.n the 
cel, as a non-integtated vector (such as a plasmid) or integrated into the host cell genome. The 
polynucleotide encoding a polypeptide involved in olfactory nation can be isolated from the 
host cell by methods well known within the art. See, e.g., Sambrook e. 1. (1989). 
20 Alternatively, PCR allows amplification of DNA sequence, PCR technology ts well 

know, in the art and is described in U.S. Pa,. No, 4,683,195, 4,800,1 59, 4,754,065 and 
4,683,202, as well as PCR: The Polymerase Cham fraction, Mollis ct al. ed,, Btrkhauaw 

Press, Boston (1994). 

RNA can be obtained in a number of ways in an appropriate vector and the vector ,s 

25 transformed into a suitable host cell. When the inserted DNA is inscribed into RNA the 
RNA can then be isolated using methods well known to those of skill in .he art, as se, forth . 
Sambrook e, al., (.989), for example. RNA can also be obtained through in vitro teacon, 
For example, .he polynucleotide, which encodes a polypeptide involved in olfactot, sensatton, 
can be inserted into a vector drat contains appropriate inscription promoter sequences. 

30 Commercially available RNA polymerases will specifically initiate transcription a. thetr 
promoter sites and continue .he transcription process through the adjoimng DNA 
polynucleotide, Placing the polynucleotide sequences which encode polypeptides involved ,n 



21 



WO 01/27158 



PCT/US00/27582 



olfactory sensation between two such promoters allows the generation of sense orantisense 
strands of desired RNA. 



Cloning and expression vectors comprising polynucleotide sequences encoding polypeptides 
5 involved in olfactory sensation 

The present invention further includes a variety of vectors containing polynucleotides 

encoding polypeptides involved in olfactory sensation. These vectors can be used for 

expression of recombinant polypeptides as well as a source of polynucleotides which encode 

polypeptides involved in olfactory sensation. Cloning vectors can be used to obtain replicate 

10 copies of the polynucleotides, which encode polypeptides involved in olfactory sensation, they 
contain, or as a means of storing the polynucleotides in a depository for future recovery. 
Expression vectors (and host cells containing these expression vectors) can be used to obtain 
polypeptides produced from the polynucleotides they contain. Suitable cloning and expression 
vectors include any known in the art, e.g., those for use in in vitro, bacterial, mammalian, yeast 

1 5 and insect expression systems. Specific vectors and suitable host cells are known in the art and 
need not be described in detail herein. For example, see Gacesa and Ramji, Vectors, John 
Wiley & Sons (1994). 

Cloning and expression vectors typically contain a selectable marker (for example, a 
gene encoding a protein necessary for the survival or growth of a host cell transformed with the 

20 vector), although such a marker gene can be carried on another polynucleotide sequence co- 
introduced into the host cell. Only those host cells into which a selectable marker has been 
introduced will survive and/or grow under selective conditions. Typical selectable markers 
encode protein(s) that (a) confer resistance to antibiotics or other toxins substances, e.g., 
ampicillin, neomycin, methotrexate, etc.; (b) complement auxotrophic deficiencies; or (c) 

25 supply critical nutrients not available from complex media. The choice of the proper marker 
gene will depend on the host cell, and appropriate genes for different hosts are known in the art. 
Cloning and expression vectors also typically contain a replication system recognized by the 
host. 

Suitable cloning vectors may be constructed according to standard techniques, or may 
30 be selected from a large number of cloning vectors available in the art. While the cloning 

vector selected may vary according to the host cell intended to be used, useful cloning vectors 
will generally have the ability to self-replicate in an appropriate host, may possess a single 
target for one or more particular restriction endonucleases, and/or may carry genes for a marker 
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that can be used in selecting clones containing the vector. Suitable examples include plasmids 
and bacterial viruses, e.g., pUC18, pUC19, ml3mpl8, ml3mpl9, pBR322, pMB9, ColEl, 
pCRl, RP4, phage DNAs, and shuttle vectors such as pSA3 and pAT28. These and many other 
cloning vectors are available from commercial vendors such as BioRad, Stratagene, and 
Invitrogen. 

Expression vectors generally are replicatable polynucleotide constructs that contain a 
polynucleotide encoding an polypeptide involved in olfactory sensation of interest. The 
polynucleotide, which encodes a polypeptide involved in olfactory sensation, encoding the 
polypeptide is operatively linked to suitable transcriptional controlling elements, such as 
promoters, enhancers and terminators. For expression (i.e., translation), one or more 
translational controlling elements are also usually required, such as ribosome binding sites, 
translation initiation sites, and stop codons. These controlling elements (transcriptional and 
translational) may be derived from the gene encoding polypeptides involved in olfactory 
sensation, or they may be heterologous (i.e., derived from other genes and/or other organisms). 
A polynucleotide sequence encoding a signal peptide can also be included to allow a 
polypeptide involved in olfactory sensation to cross and/or lodge in cell membranes or be 
secreted from the cell. A number of expression vectors suitable for expression in eukaryotic 
cells including yeast, insect, avian, plant and mammalian cells are known in the art. Common 
vectors, such as YEpl3 and the Sikorski series pRS303-306, 313-316, 423-426 can also be 
used. Vectors pDBV52 and pDBV53 are suitable for expression. Another example of an 
expression vector/host cell system is the baculovirus (e.g., nuclear polyhedrosis virus)/insect 

cell (e.g., sf9 cells) system. 

Human olfactory receptor polypeptides are expressed from olfactory receptor cDNA by 
methods well-known to those of skill in the art. A cDNA or portion thereof is inserted in an 
expression vector using standard molecular cloning techniques. Coupled in vitro transcription 
and translation of such a vector results in expression of the OR protein encoded by the cDNA. 
In vivo expression of a OR polypeptide is accomplished by inserting an OR cDNA into a 
eucaryotic or procaryotic expression vector, of which many are known in the art, to genereate 
an OR expression construct. The OR expression construct is introduced into an appropriate 
host cell in which the OR sequences are expressed (by transcription and translation) and 
optionally secreted, and the expressed OR polypeptide is obtained from the cell growth medium 
and/or from cell lysates. 
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A number of expression vectors are known in the art. Prokaryotic expression vectors 
include, but are not limited to, T7 RNA polymerase/T7 promoter-based vectors, 
bacteriophage X-based vectors and various types of fusion vectors. Fusion vectors include, but 
are not limited to, lacZ and trpE fusion vectors, maltose binding protein flision vectors, 

5 glutathione-S-transferase fusion vectors, and thioredoxin fusion vectors. Baculovirus-based 
vectors are used for expression in insect cell systems. Expression in mammalian cells (such as 
HEK, COS and CHO cells) utilizes vectors containing a mammalian origin of replication (such 
as, for example, a SV40 origin), an efficient promoter (optionally including one or more 
enhancer sequences), mRNA processing signals (e.g., splice sites and polyadenylation sites), 

1 0 one or more selectable markers, and optionally a prokaryotic replicon to allow propagation and 
manipulation of the construct in prokaryotic cells. Alternatively, expression in mammalian 
cells is achieved through the use of any of a number of mammalian viral vectors including, but 
not limited to, retroviruses, lentiviruses, Semliki Forest viruses, vaccinia viruses, adenoviruses 
and adeno-associated viruses. 

1 5 Vectors containing the polynucleotides of interest can be introduced into the host cell 

by any of a number of appropriate means, including electroporation, direct injection, 
transfection employing calcium chloride, rubidium chloride, calcium phosphate, DEAE- 
dextran, or other substances; microprojectile bombardment; lipofection; and infection (where 
the vector is an infectious agent, such as a virus). The choice of means of introducing vectors 

20 or polynucleotides encoding polypeptides involved in olfactory sensation will often depend on 
the host cell, as will be well known to those of skill in the art. 

Host cells transformed with polynucleotides encoding polypeptides involved in olfactory 
sensation 

25 Another embodiment of this invention are host cells transformed with (i.e., comprising) 

polynucleotides encoding polypeptides involved in olfactory sensation, and/or vectors having 
polynucleotide(s) sequences encoding polypeptides involved in olfactory sensation, as 
described above. Both prokaryotic and eukaryotic host cells may be used. Prokaryotic hosts 
include bacterial cells, for example E. coli, B. subtilis, and mycobacteria. Among eukaryotic 

30 hosts are yeast, insect, avian, plant and mammalian cells. Host systems are known in the art 
and need not be described in detail herein. 

The host cells of this invention can be used, inter alia, as repositories of 
polynucleotides encoding polypeptides involved in olfactory sensation, and/or vehicles for 
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production of polynucleotides encoding polypeptides involved in olfactory sensation, and/or 
polypeptides involved in olfactory sensation . They may also be used as vehicles for in vivo 
delivery of polypeptides involved in olfactory sensation . 

Uses/or and methods using polynucleotides encoding polypeptides involved in olfactory 

sensation . 
To determine whether a vector containing polynucleotides is capable of expressing in 

eukaryotic cells, cells such as, for example, COS-7 (primate origin), CHO (rodent origin), 

HEK-293 (human origin), or HeLa (human origin) cells can be transfected with the vector. 

Expression of a polypeptide(s) encoded by the vector is then determined by, for example, RIA, 

ELISA, immunofluorescence of fixed cells, or western blotting of cell lysate using an antibody 

as a probe. Antibodies can be obtained using, as immunogen, peptide sequences synthesized 

from the protein sequences encoded by the known polynucleotide sequence. Polypeptides can 

be purified by, for example, phase partitioning, affinity methods, gel filtration and ion 

exchange, as well as additional methods known by those skilled in the art. Further 

characterization of the expressed polypeptide can be achieved by purification of the 

polypeptide using techniques known in the art. 



Polypeptides involved in olfactory sensation 

The present invention encompasses polypeptides involved in olfactory sensation. Expression of 
said polypeptides is localized in the olfactory neurons located in the olfactory epithelium, as 
described earlier. The polypeptides may comprise any novel sequence encoded by a nucleotide 
sequence as depicted in SEQ ID NO:l through SEQ ID NO:73 and SEQ ID NO:l 1 1 through SEQ 
IDNO:152. 

The invention includes modifications to polypeptides involved in olfactory sensation 
including functionally equivalent fragments of the polypeptides involved in olfactory sensation 
which do not significantly affect their properties and variants which may have enhanced or 
decreased activity. Collectively, these modifications may be termed "analogs" of or a fragment of 
polypeptides involved in olfactory sensation. Modification of polypeptides is routine practice in 
the art and need not be described in detail herein. Examples of modified polypeptides include 
polypeptides with conservative substitutions of amino acid residues, one or more deletions or 
additions of amino acids which do not significantly deleteriously change the functional activity, 
or use of chemical analogs. Amino acid residues which can be conservatively substituted for 
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one another inelude bu. are nor limited to: glycine/alanine; valine/isoleucine/leucine; 
asparagine/glutamine; aspanic acid/g.utamic acid; serine/threonine; lysWarginine; and 
phenylalanine/ryrosine. Such conservative substitutions are known in tbe art, and prefembly, 
the antino acid substitutions would be such that the substituted amino acid would possess 
5 similar chemical oroperries as that of the origina, amino acid. These polypeptide, also mclude 
glycosylated and non-glycosylated polypeptides, as well as polypeptides with other post- 
national modifications, such as, for example, glycosylate with different sugars, 
acetylation, and phosphorylation. Amino acid modifications c range from changing or 
modifying one or « amino acids to complete redesign of a region. Other methods of 
,0 modification include using coupling technics known in the art. including, but no, m,,ed to, 
enzymatic means, oxidative substitution and chelation. Modified polypeptides mvolved m 
olfactory sensation are made using established procedures in the art. 

The invention also encompasses fusion proteins comprising one or more polypepttdes 
involved in olfactory sensation. For purposes of this invention, an fusion protein contains one 
,5 or more polypeptides involved in olfactory sensation and anorher amino acid sequence 
U is not Lched in rhe native molecule, for example, a heterologous sequence or a 
sequence from another region. Useful heterologous sequences include, bu, are no, hmrted ,o, 
sequences that provide for secretion from a bos, cel., intracellular trafficking, and 
stability/degradation. Odter useful heterologous sequences are ones which facluate 
20 purification. Examples of such sequences are know, in me an and include those encodmg 
epitopes such as Myc, HA (derived from influenza vims hemagglutinin), H, S -6, or FLAG. 
Cher heterologous sequences dta, facilitate purification are derived from proteins such as 
glutathione S -transferase (GST,, maltose-binding protein (MBP), or the Fc portton of 
immunoglobulin. 

25 

Preparation of polypeptides involved in olfactory sensation 

The polype^ides of this invention can be made by procedures known ,n the art. The 
polypeptides can be produced by recombinant methods (i.e., sing, or fusion polypeptides) or 
by chemical synthesis. Po.ypeptides, especially shorter polypeptides up to about 50 ammo 
30 adds, are conveniently made by chemical synthesis. Methods of chemical synthes.s are known 
intheartandarecommercia.lyavai.able. For example, a polypeptide can be produced by an 
automated polypeptide synthesizer employing the solid phase method. Po.ypept.des can also 
be made by chemical synthesis using techniques known m the art. 
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Polypeptides can also be made by expression systems, using recombinant methods. 
The availability of polynucleotides encoding polypeptides permits the construction of 
expression vectors encoding intact (i.e., native) polypeptide, functional equivalents and 
functional fragments thereof, modified forms or recombinant forms. A polynucleotide 
5 encoding the desired polypeptide, or a fusion protein, can be ligated into an expression vector 
suitable for any convenient host. Both eukaryotic and prokaryotic host systems can be used. 
The polypeptide is then isolated from lysed cells or from the culture medium and purified to the 
extent needed for its intended use. Purification or isolation of the polypeptides expressed in 
host systems can be accomplished by any method known in the art ( e.g. partitioning exclusion, 

10 ion exchange chromatograph, gel filtration, etc.). Other controlling transcription or translation 
segments, such as signal sequences that direct the polypeptide to a specific cell compartment 
(i.e., for secretion), can also be used. Examples of prokaryotic host cells are known in the art 
and include, for example, E. coli and B. subtilis. Examples of eukaryotic host cells are known 
in the art and include yeast, avian, insect, plant, and animal cells such as COS7, HeLa, CHO, 

1 5 HEK-293 and other mammalian cells. 

Alternatively, in vitro expression systems may also be used to produce 
polypeptides involved in olfactory sensation. A plasmid containing a polynucleotide encoding 
polypeptides involved in olfactory sensation, under the control of an appropriate promoter, can 
be transcribed and the resultant RNA translated in vitro through the use of commercially 

20 available reagents. Such methods can be used to produce relatively pure samples of the 
polypeptide and are known in the art. 

Preferably, the polypeptides are at least partially purified from other cellular 
constituents. In one embodiment, the polypeptides are at least 70%, more preferably at least 
80%, even more preferably at least 90% or most preferably at least 95% pure. In this context, 

25 purity can be calculated as a weight percent of the total protein content of the preparation. 
More highly purified polypeptides may also be obtained and are encompassed by the present 
invention. Methods of protein purification are known in the art and are not described in detail 
herein. For membrane-bound proteins, the lipid content of the preparation, which is required to 
maintain the structure and function of the protein, is excluded from the purity calculation. That 

30 is, if a preparation weighing 10 mg has 5 mg lipid, 4 mg of desired protein, and 1 mg of 
undesired proteins, the purity is calculated as 80% (desired protein content divided by total 
protein content). Preparations of biological or synthetic molecules suitable for maintaining 
structure and function of membrane proteins are described in Etemadi AH (1985) Adv Lipid 
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Res 1985;21:281-428; Villalobo A (1990) Biochimica Et Biophysica Acta, 1017(l):l-48; 
Montal M (1987) Journal Of Membrane Biology 98(2): 101-115; Scotto AW et al. (1 987) 
Biochemistry 26(3): 833-839; Jain MK and Zakim D (1987) Biochimica Et Biophysica Ac 
906(1): 33-68; Czerski L and Sanders CR (2000) Anal Biochem 284(2):327-33 (lipid- 
5 detergent mixtures or "bicelles"); Hrafhsdottir S and Menon AK (2000) J Bacteriol 

182(15):4198-206 (proteoliposomes); Puu G et al. (2000) Biosens Bioelectron 15(1-2):31- 
41 (protein-lipid preparations on solid surfaces); Schafrneister CE et al. (1993) Science 
262(5 1 34):734-8 ("peptitergents"). 



1 0 Uses of polypeptides involved in olfactory sensation 

The polypeptides of this invention have a variety of uses. They can be used, for example, 
to screen odorant ligands in order to determine the scent representations, scent profiles or scent 
fingerprints of particular odorant molecules and further to characterize the effect of functional 
groups and chemical characteristics on perceived smell. Methods for screening odorant 

1 5 compounds using odorant receptors in neuronal cells are known in the art (Firestein et al., WO 

98/50081 ; Duchamp-Viret etal, Science 1999, 284 2171-2174; Sato et al, J. Neurophys. 1994 72 
2980-2989; Malnic et al. Cell 1999 96 713-723; Zhao et al., Science 1998 279, 237-242). There 
are also methods which can be employed to screen odorant compounds which do not require 
neuronal cells and are known in the art (Kauvar et al., U. S. Pat. No. 5,798,275; Kiefer et al., 

20 Biochemistry 1996 35 16077-16084; Krautwurst et al, Cell 1998 95 917-926), 

Analysis of the scent can be performed in a number of ways. Various embodiments of 
the scent analysis system are presented. Examples of how these embodiments might operate 
are also presented, although it should be emphasized that the invention is not limited by any 
25 particular theory of olfactory perception or scent analysis. 



Olfactory Space 

^ The sensory subsystem comprises a series of olfactory receptors, which selectively bind 
with the chemical component(s) making up the scent. The scent can be characterized in terms 
30 of which of the approximately 1,000 olfactory receptors the scent components) bind to, and the 
strength of the interaction of the component(s) with those receptors. Each olfactory receptor 
can be considered an orthogonal basis vector; the entire set of olfactory receptors can be 
considered a set of basis vectors spanning "olfactory space." This is analogous to vectors 
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pointing along the x, y, and z directions in three-dimensional space, where any point in space 
can be represented by a combination of the x, y, and z basis vectors (with each of the x, y, and z 
vectors multiplied by the appropriate scalar quantity). The intensity of interaction of a scent 
with an olfactory receptor determines the magnitude of the vector along that particular "axis" in 
5 olfactory space. Thus, every scent can be uniquely described by a vector representation in 
olfactory space. 

A representation of a scent in such a manner that the scent can later be re-created is 
defined as scent profiling. The aforementioned vector representation is one example of a scent 
profile. 

10 

Primary Scents 

For the purposes of this invention, a receptor primary scent component is defined as a 
chemical that interacts with one and only one scent receptor. A receptor complex scent 
component is defined as a chemical that interacts with more than one scent receptor; the 
1 5 receptor complex scent component can interact with each of the scent receptors to different 
degrees, to equal degrees, or can interact with some receptors to the same degree and others to 
different degrees. 

Olfactory receptors are proteins which fall in the class of seven transmembrane domain 
G protein-coupled receptors, and are found in olfactory neurons in vivo. Binding of an odorant 

20 to an olfactory receptor causes second messenger systems to become activated or inhibited in 
the cell, leading to increased cellular production of second messenger molecules such as cyclic 
AMP. These second messenger systems in turn lead to the depolarization of the olfactory 
neuron, or other changes in the state of the neuron, which provides the signal to the nervous 
system that the odorant has been detected. 

25 With a complete set of receptor primary scent components, any scent can be re-created 

with the knowledge to the degree to which it interacts with each olfactory receptor. The instant 
invention encompasses such complete sets of receptor primary scent components. Other 
embodiments of the invention encompass sets of receptor primary scent component chemicals 
which provide the ability to re-create a particularly desired subset of scents, but not necessarily 

30 all possible scents. Still more embodiments encompass sets of receptor primary scent 

component chemicals which provide the ability to approximate particular scents, while not 
necessarily exactly re-creating the interaction profile of the particular scents. 
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In some cases, a receptor complex scent will be an acceptable approximation to a 
receptor primary scent. That is, if a given receptor complex scent interacts with a first scent 
receptor strongly, but interacts with other scent receptors less strongly, it can be considered an 
approximation to a receptor primary scent component for the first receptor. Such a receptor 
5 complex scent component is described by the term receptor quasi-primary scent component. 
One embodiment of the invention encompasses sets of receptor quasi-primary scent component 
chemicals suitable for re-creating all scents. Another embodiment of the invention 
encompasses sets of receptor quasi-primary scent component chemicals suitable for re-creating 
a particularly desired subset of scents, but not necessarily all possible scents. Yet another 
10 embodiment encompasses sets of receptor quasi-primary scent component chemicals which 
provide the ability to approximate particular scents, while not necessarily exactly re-creating 
the interaction profile of the particular scents. 

The identification of receptor primary or quasi-primary scent component 
chemicals provides the most conceptually straightforward method of re-creating scents. 
15 However, another embodiment of the invention encompasses the use of receptor complex 
scent components for re-creating scents. An example of such an embodiment would be re- 
creation of a scent that activates olfactory receptors designated OR1, OR2, OR3, OR4, 
OR5 and OR6 (for the sake of illustration, it is assumed that the olfactory receptors are 
stimulated to an equal extent). If one is in possession of two receptor complex scent 
20 component chemicals (RCSC's) where RCSC1 activates OR1 and OR5, and RCSC2 

activates OR2, OR3, OR4, and OR6, then one can reproduce the original scent by mixing 
RCSC1 and RCSC2 to re-create the original olfactory receptor activation profile. In 
practice, the profiles of various receptor complex scent components will be much more 
complicated than the forgoing example, and components which inhibit olfactory activation 
25 as well as stimulate activation can be included in the sets. However, once receptor 

activation profiles of sufficient receptor complex scent components are known, computer 
algorithms can be utilized to create the appropriate combination of receptor complex scent 
components. Using vector representations of the olfactory receptor activation profiles for a 
set of receptor complex scent components, one can create linear combinations of such 
30 receptor complex scent components in order to represent a particular scent. For the 

example given above, such a vector representation would look like (1, 0, 0, 0, 1, 0) for the 
first receptor complex scent component and (0, 1, 1, 1, 0, 1) for the second receptor 
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complex scent component, while the vector representation of the scent to be re-created is 
(1,1,1,1,1,1). If x i and X2 are the relative proportions of the first receptor complex scent 
component and the second receptor complex scent component, respectively, to be 
combined to re-create the scent, then the problem can be represented as a series of linear 
5 equations: 

IX, + 0X 2 = 1 
0X1 + 1X2 = 1 
OXi + lX 2 = 1 
OX, + 1X2 - 1 
lXl + 0X2 = 1 
OX, + 1X2 = 1 



and the solutions for x, and x 2 are x, = 1 , x 2 = 1 . Solutions to systems of linear equations 
have been thoroughly studied and many algorithms are available for implementation on 
computers, including algorithms which evaluate the accuracy of an approximate solution 

1 0 when an exact solution cannot be determined. (See, e.g., Dettman, J. W., Introduction to 
Linear Algebra and Differential Equations, Dover Pubs., 1986; Press W.H. et al., 
Numerical Recipes in C: The Art of Scientific Computing, 2nd ed., Cambridge University 
Press, 1993; Vetterling (ed.) Numerical Recipes in C: The Art of Scientific Computing/Disk 
V2.02, Cambridge University Press, 1997.) These methods can also be used to determine 

1 5 whether a set of receptor complex scent components is suitable for re-creating a given 

scent. For example, if the scent to be recreated is represented by the vector (1,1,1,1,1,2), 
there will be no solution to the resulting system of linear equations using the two receptor 
complex scent components in the illustration above. In this instance, one or more 
additional receptor scent components will need to be identified in order to be able to re- 

20 create the scent in terms of the receptor primary scent components. Alternatively, the scent 
represented by (1, 1, 1, 1, 1, 1) may be an acceptable approximation to the scent 
represented by (1, 1, 1, 1, 1,2). Integers are used in this example for clarity, but the vectors 
can contain any real number representing a measured intensity; for example, (1.1, 0.997, 
1.08, 1.2, 0.88888..., 2.00001) may be an acceptable approximation to the scent 

25 represented by (1 , 1 , 1 , 1 , 1 , 2). 
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It will be readily appreciated that the choice of a complete set of receptor primary, 
quasi-primary, or complex scent component chemicals (capable of generating all scents) versus 
a partial set of receptor primary, quasi-primary, or complex scent component chemicals 
(capable of generating, exactly or approximately, a subset of scents) depends on the application 
for which scent re-creation is desired. 

A special category of receptor scent components are chemicals which bind to a 
receptor without activating it. If these non-activating chemicals prevent chemicals which 
do activate the receptors from binding, the non-activating chemicals act to "turn off those 
receptors. These non-activating chemicals, or receptor binding antagonists, are particularly 
useful in editing scents, as they can be added to a scent to attenuate or eliminate particular 
aspects of the scent. In the vector example above, if a particular receptor antagonist blocks 
OR2, OR3, and OR4, but not OR1, OR5 or OR6, then it can be represented in vector 
format as (0,-1, -1,-1, 0,0). In the reproduction of (1, 1, 1, 1, 1 , 2) from the vectors 
(1, 0, 0, 0, 1, 0) and (0, 1, 1, 1, 0, 1), the following combination can be used: 
1 x (1, 0, 0, 0, 1, 0) + 2 x (0, 1, 1, 1, 0, 1) + 1 x (0, -1, -1, -1, 0, 0) to yield the vector 
(1,1,1,1,1,2). In some instances, enough of a particular receptor binding antagonist is 
used to eliminate any possibility of activation by a receptor scent component, in which case 
the vector entry for the receptor(s) which are blocked by that antagonist contains 0 in the 
vector position corresponding to that receptor(s). 

Perceptive primary scents are defined as scents that give a single scent perception, for 
example, the scent "lemon" as perceived by a human. A perceptive primary scent can be 
composed of one or more receptor primary scent components, one or more receptor complex 
scent components, or a mixture of one or more receptor primary scent components and one or 
more receptor complex scent components. Since perceptive primary scents are to some extent 
subjective, identification of perceptive primary scents can be performed by using a panel of 
subjects who evaluate and describe scents. A perceptive complex scent is made up of more 
than one perceptive primary scent. The boundaries between a perceptive primary scent and a 
perceptive complex scent are also to some extent subjective; for example, one person may 
describe a scent as "pizza," while another person may describe the same scent as "sausage, 
cheese and tomato sauce." That is, one person may perceive a scent as a perceptive primary 
scent for "pizza," while another person may perceive the same scent as a perceptive complex 
scent made up of several individual perceptive primary scents. In order to standardize 
perceptive scents, a panel of five or more, preferably ten or more, more preferably fifty or 
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more, still more preferably one hundred or more, people can be surveyed to label various 
perceptive scents. When a plurality, preferably a majority, more preferably 66 2/3 % or 
greater, still more preferably 95 % or greater, even more preferably 99% or greater, of the panel 
identif.es a scent as the same scent (e.g., of a panel of 100 people, 95 describe a scent as 
"pizza," while the other 5 describe the scent otherwise), the scent can be labeled as a perceptive 
scent (the perceptive scent can be primary or complex, depending on whether the panel 
identifies it as a single scent or a mixture of scents). 

In fields where existing classification schemes already exist, the perceptive primary and 
complex scents can be indexed according to those schemes. For example, the SFP (Societe 
Francaise des Parfumeurs) has drawn up a classification system based on 5 main groups, sub- 
divided into classes. Such a classification can be used for selecting perceptive primary scents 
and used as guides for combining the scents. 

Selecting Chemicals for Scent Re-creation 

A scent which has been represented as a set of basis vectors in olfactory space can in 
principle be re-created simply by mixing the receptor primary scent components, receptor 
quasi-primary scent components, or receptor complex scent components needed to interact the 
olfactory receptors in the same pattern as the original scent. Such an approach requires 1) a 
method to generate a representation of the original scent in olfactory space, and 2) suitable 
receptor primary scent component chemicals which can be mixed in the appropriate manner. 

Identification of receptor scent components can be performed by various methods. One 
such method assays the interaction of candidate components with each olfactory receptor. The 
receptors can be expressed in vitro and assays can be set up to monitor the interaction of 
various candidate components with each individual receptor. Chemicals which interact with 
one and only one olfactory receptor are receptor primary scent components, while chemicals 
which interact with more than one olfactory receptor are receptor complex scent components 
(and can possibly be receptor quasi-primary scent components, depending on the interaction 
profile it displays with the olfactory receptors). Such an approach can use methods known in 
the art, for example those of Breer et al, Ann. N. Y. Acad. Sci. (1998) 855:175-81 or Malnic et 
ai, Cell (1 999) 96(5):713-23. Breer et al. expressed olfactory receptors in Sf9 cells and 
evaluated the second-messenger response to various odorants. Malnic et al. isolated olfactory 
neurons from mice and utilized calcium imaging to study the response of the neurons to 
different odorants, while using RT-PCR to determine which olfactory receptor was expressed 
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in the neuron under study. U.S. Patent No. 5,798,275 describes a method for evaluating 
interaction of compounds with members of a reference panel of proteins. WO 98/50081 
discloses methods for detecting particular odorant ligand specificity for particular odorant 
receptors in nasal epithelium tissue of mammals such as rats and mice. 

Selection of Receptor Primary See *" hv in silico Methods 

An alternative method utilizes in silico screening techniques-that is, computer 
simulation methods-for selecting candidate components. Protein-ligand screening can be used 
to select compounds which bind to particular receptors in order to identify receptor pnmary 
scent components. Examples of such programs are DOCK, AutoDock, GOLD, FlexX, LUDI, 
GROWMOL, and HOOK. (See Wang, J., Kollman, P.A., Kuntz I.D., "Flexible ligand 
docking: a multistep strategy approach," Proteins 36(1):1-19 (1999) and references therem.) 
These programs function by taking a protein structure and either matching compounds of 
known structure to the protein structure to determine the protein-ligand interaction, or by 
"growing" a molecule in the active site or binding site of a protein to determine what molecule 

will best interact with the protein. 

Olfactory receptor proteins are membrane proteins, and experimental determination of 
the three-dimensional structures of membrane proteins has lagged the corresponding structural 
determination of water-soluble proteins for various reasons. However, alternative methods for 
constructing the three-dimensional structures of proteins are available. The primary (ammo 
acid) sequences of many olfactory receptors are known. This information can be used to model 
a three-dimensional structure of a receptor protein using various algorithms and computer 
programs known in the art. The resulting model structure can then be used as the bas.s for 
evaluating interaction of candidate components with the receptor. 

Alternatively, given known chemical structures which give rise to a particular odor, 
analysis of the structures can indicate the particular portion of the chemical structure which is 
responsible for the odor. This is analogous to "pharmacore analysis" used in medicnal 
chemistry to determine the important portion of drugs. 

Methods for developing compounds which bind to receptors and other proteins of 
known structure, and determining interactions between ligands and receptors, are described in 
various references The DOCK program evaluates the fit of a ligand into a protein molecule of 
known structure (see Gschwend, D.A., Good, A.C. and Kuntz, I.D., "Molecular Docking 
Towards Drug Discovery",/ Mol Recognition 9, 175-86 (1996); Kuntz, I.D., Meng, E.C., and 
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B.K. Shoichet, "Structure-Based Strategies For Drug Design and Discovery", Acc. Chem. Res. 
27, 117-123 (1994); and Kuntz, I.D., "Structure-based strategies for drug design and 
discovery", Science 257, 1078-1082 (1992); see also 

http://www.cmpharm.ucsf. edu/kuntz/dock.html). Using a known (or modeled) structure of an 
olfactory receptor, DOCK can be used to screen for compounds which bind to the receptor. 
The program AMBER (see Cornell, WD, Cieplak P, Bayly CI, Gould IR, Merz KM Jr, 
Ferguson DM, Spellmeyer DC, Fox T, Caldwell JW and Kollman PA. "A second generation 
force field for the simulation of proteins and nucleic acids," Journal of the American Chemical 
Society 117, 5179-5197 (1995); Computer Simulation of Biomolecular Systems, A. Wilkinson, 
P. Weiner, W. Van Gunsteren, eds. Volume 3, p. 83-96, P. Kollman, R. Dixon, W. Cornell, T. 
Fox, C. Chipot and A. Pohorille; Bayly CI, Cieplak P, Cornell WD and Kollman PA. "A well- 
behaved electrostatic potential based method using charge restraints for deriving atomic 
charges - the RESP model," Journal of Physical Chemistry 97(40), 10269-10280 (1993); 
Cornell WD, Cieplak P, Bayly CI and Kollman PA. "Application of RESP charges to calculate 
conformational energies, hydrogen bond energies, and free energies of solvation," Journal of 
the American Chemical Society 1 1 5(21), 9620-963 1 (1 993); see also 
http://www.amber.ucsf.edu/amber/amber.html) can be used to calculate more precise 
interaction energies between candidate ligands. Other examples of such methods are described 
in, for example, U.S. Patent No. 5,866,343, directed to determining the energetically favorable 
binding site between two molecules; U.S. Patent No. 5,854,992, a system and method for 
structure-based drug design which takes into account binding free energy as it "grows" 
candidate molecules into a receptor binding site; and U.S. Patent No. 5,495,423, which 
describes a method for ligand design (principally applicable to peptidic ligands). 

The foregoing methods typically depend on a known three-dimensional structure for the 
receptor. When such a structure cannot or has not been determined experimentally, a structure 
can be modeled using computer algorithms. Blundell TL, Sibanda BL, Sternberg MJ, Thornton 
JM, "Knowledge-based prediction of protein structures and the design of novel molecules," 
Nature 326(61 1 1):347-52 (1987); Shortle D, "Structure prediction: The state of the art," Curr 
Biol 9(6):R205-9 (1999), Morea V, Leplae R, Tramontane A, "Protein structure prediction and 
design," BiotechnolAnnu Rev 4:177-214 (1998) and Onuchic JN, Luthey-Schulten Z, Wolynes 
PG, "Theory of protein folding: the energy landscape perspective," Annu Rev Phys Chem 
48:545-600 (1997) address various methods of predicting protein structure from sequence data. 
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Various implementations for predicting protein structure from amino acid sequences are 
discussed in U.S. Patent Nos. 5,878,373 and 5,884,230. 

If the structure, or even the identity, of the targeted receptor cannot be determined, 
alternative computational techniques can be used to generate information regarding possible 
ligands which will interact with the receptor. Quantitative structure-activity relationsh.ps 
(QSAR; see Green, S.M. and Marshall, G.R., "3-D QSAR: A current perspective," Trends 
Pharmacol Sci 16:285 (1995); and 3D QSAR in Drug Design: Theory, Methods and 
Applications, Kubinyi, H. Ed, Escom, Leiden.), including QSAR refinements such as 
comparative molecular field analysis (ComFA) (Cramer, R. D. et al. "Comparative Molecular 
Field Analysis ComFA 1 . Effect Of Shape On Binding Of Steroids To Carrier Proteins," J. Am. 
Chem. Soc. 1 10: 5959 (1 988)); and pharmacophore mapping (Martin YC, Bures MG, Danaher 
EA DeLazzer J, Lico I, Pavlik PA, "A fast new approach to pharmacophore mapping and its 
application to dopaminergic and benzodiazepine agonists," J Comput Aided Mol Des 7(1):83- 
1 02 (1 993)) have been used to design pharmacophores that can interact with the receptor. U.S. 
Patent No. 5,699,268 provides a method for producing computer-simulated receptors which 
functionally mimic biological receptors; the simulated receptors are essentially abstractions of 
structurally useful information from compounds which are known to interact with a receptor. 
U S Patent No. 5,901 ,069 describes a method of automatically refining a set of chem.cals 
using structure/activity data. U.S. Patent No. 5,862,514 describes a method of simulating 
synthesis of compounds of desired biological activity and evaluating their activity via further 
simulations. 

Application of structure-function relationships to classification of odors has been 
described by Chastrette M., Rallet E. "Structure-minty odour relationships: Suggestion of an 
interaction pattern," Flavour and Fragrance Journal, 13(1):5-18 (1998); Chastrette M., De 
Saint Laumer J.Y.,; Peyraud J.F., "Adapting the structure of a neural network to extract 
chemical information. Application to structure-odour relationships," SAR QSAR Environ Res 1 
(2-3) 221-231 (1993), Chastrette M., "Trends in structure-odor relationships," SAR QSAR 
Environ Res 6(3-4):21 5-254 (1997) and Jain et al., "A shape-based machine learning tool for 
drug design," J Comput Aided Mol Des 8(6):635-652 (1994). These methods can be useful in 
determining the "chemical distance" between odors. For example, isoamyl acetate is typically 
experienced as a banana-like odor, while octyl acetate is typically experienced as an orange- 
,ike odor, which gives a measure of how the chain length of the a.koxy portion of the ester 
influences perception. 
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Olfactory Receptors and Libraries of Olfactory Receptors 

The olfactory receptors of the invention can be used to analyze and describe the 

interaction of scent odorant molecules with each receptor. This can be done individually, 
5 receptor-by-receptor and odorant molecule by odorant molecule. However, a combinatorial 

approach provides a much more powerful method of analyzing and describing the 

interaction of scent odorant molecules with olfactory receptors. 

In one embodiment, the invention comprises libraries of olfactory receptors. These 

libraries are used to screen compositions for interaction with receptors. A composition can 
10 be a single compound (essentially a pure chemical), or a mixture of two or more 

compounds or chemicals. The compositions can be presented to the library in vapor form, 

or in solutions, typically aqueous solutions. 

The method for determining the binding pattern of a composition with olfactory 

receptors comprises the steps of: exposing the composition to an olfactory receptor library; 
1 5 and determining whether the composition binds to each olfactory receptor of the library, 

thereby determining the overall binding patter of the composition. While it is desirable to 

determine whether the composition binds to each of the olfactory receptors, in certain 

cases, determining the binding pattern to a subset of the receptors is suitable. Such a 

situation can arise if the complete pattern is not needed, or if the experiment cannot 
20 determine binding to a receptor for a particular reason. (Determining the binding to a 

subset is equivalent to reducing the olfactory receptor library to that subset of receptors.) 
Typically, the libraries are prepared as arrays, where the position of each olfactory 

receptor is known on the array. The arrays can take the form of multiwell plates, solid 

substrates such as chips or wafers, or any other form allowing identification of the receptor 
25 location. The arrays can be prepared in order to simply assess binding, or can be prepared 

in order to assess degree of activation as described above, using, for example, the technique 

of Malnic et al, Cell 1 999 96, 71 3-723. Alternatively, an in silico array of structures can be 

prepared, using the known primary structure of the receptors and the modeling techniques 

described above. 

30 The libraries contain at least two olfactory receptors. In increasing order of 

preference, the libraries contain at least 5, 10, 20, 30, 40, 50, 75, 100, 200, 300, 400, 500, 
600, 700, 800, 900, 1000, 1200, 1400, 1500, 1600, 1800, or 2000 olfactory receptors. The 
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receptors are presented as ordered arrays. For example, a96-well plate can contain 96 
receptor preparations. Upon exposure to a composition, the plate can be seined, and me 
response of each receptor in each well can be eva,ua,ed. This leads to a 96-e.emen, vector 
description of the composition in terms of those 96 olfactory receptors. 

5 In one embodiment, binding to the olfactory receptors is assessed. In another 

embodiment, the approximate binding constant of the composition to me olfactory 
receptors is determined. In yet another embodtment, the degree of activation of tire 
olfactory receptor by tire composition is determined. For receptor antagonists, bindtng wrll 
occur, but no activation will occur; the invention embraces the identification of such 

10 antagonists. 

The compositions for use are varied. A set of all volatile compounds can be used. 
A standard set of perfumes or odorants can be used. A set of commercially used scents can 
be used. Sets of compounds particularly useful in the invention are disclosed in co-pendmg 
United States Patent Application Serial No. 09/620,753. However, it must be emptied 
15 that the invention is not limited to any one set or classification of compounds. 

Preferred subsets of olfactory receptor polynucleotide sequences include: 
SEQ ID NOS: 163, 331, 414, 425,672, 762, 919, and 1027; 
SEQ ID NOS: 809 and 1067; 
SEQ ID NO: 744; 
20 SEQ ID NOS: 207, 336, 441, and 615; 

SEQ ID NOS: 157, 168, 197, 221, 250, 334, 340, 412, 413, 459, 491, 618, 690, 
694, 759, 760, 761 , 767, 819, 860, 872, 873, 917, 936, 939, 940, 947,952, 958, 959, 1023, 

1034, 1038, 1043, and 1044; 

SEQ ID NOS: 783, 785, 882, 888, 922, and 925; 
25 SEQ ID NOS: 707, 748, 752, 755, 756, 790, and 997; 

SEQ ID NOS: 1065, 1066, 1067, 1068, 1069, 1070, 1071, 1072, 1073, 1074, 1075, 
1076 1077, 1078, 1079, 1080, 1081, 1082, 1083, and 1084; 

' SEQ ID NOS: 163, 239, 331, 335, 368, 381, 385, 414, 425, 514, 572, 596, 603, 
628, 638, 642, 672,674, 689, 744, 762, 809, 835, 885, 896, 919, 920, 938, 948, 972, 999, 

30 1007, 1014, and 1027; 

SEQ ID NOS: 164, 173, 176, 180, 182, 184, 185, 188, 190, 194,207,210,213,214, 
215, 217, 219, 220, 223, 226, 227, 229, 230, 234, 235, 240, 249, 255, 265, 270, 273, 274, 
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276 277 279 281,289,291,293,294,298,302,307,311,318,319,321,330,336,339, 
34 ,' 342 343 348, 351, 356, 359, 361, 365, 366, 367, 368, 370, 372, 373, 374, 375, 376, 
378 379 380, 382, 383, 384, 385, 388, 391, 392, 393, 398, 400, 401, 403, 408, 420, 423, 
427 428 431, 434, 435, 438, 439, 440, 441, 447, 448, 450, 455, 458, 464, 465, 468, 471, 
5 473' 474 475, 478, 479, 481 , 482, 484, 485, 492, 494, 499, 502, 508, 511, 512, 513, 515, 
526 532, 534, 541, 543, 545, 546, 550, 552, 553, 557, 558, 560, 563, 564, 568, 572, 576, 
582 583, 584, 585, 586, 588, 599, 600, 605, 606, 607, 608, 609, 610, 615, 620, 621, 631, 
632 636, 638, 640, 642, 645, 648, 650, 651, 652, 654, 656, 657, 661, 662, 664, 668, 679, 
680 686, 687, 689, 691, 696, 699, 700, 702, 706, 713, 720, 721, 723, 729, 734, 738, 745, 
10 768 772, 773, 775, 791, 798, 799, 823, 857, 898, 900,901, 903, 914,931, 933, 937,941, 
945, 948, 956, 965, 969, 983, 992, 993, 994, 999, 1003, 1005, .009, 1010, 101 1, 1019, 
1028, 1035, 1037, 1052, 1061, 1062, and 1063 

SEQ ID NOS: 157, 161, 163, 168, 197, 200, 205, 218, 221, 242, 250, 331, 334, 
340 412 4,3,414,419,425,452,453,454,456,459,462,491,591,618,622,663,665, 

,5 667 670, 672, 690, 694, 695, 709, 759, 760, 761, 762, 767, 819, 820,822, 826, 832, 846, 
847 860, 872, 873, 877, 881, 887, 908, 9! 1, 913, 917, 919, 921, 936, 939, 940, 942, 944, 
947',951,952,955,958, 959,960, 964,975, 977, 979,986, 1023, 1027, .034, .038, 1043, 

1044, 1049, and 1051; 

SEQ ID NOS: 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 164, 165, 166, 
20 167, 168, 169, 170, 171, 172, .73, 174, 175, .76, 177, 178, 179, 180, 181, .82, 183, .84, 
,85 186, 187, 188, 189, ,90, .91, .92, ,93, 194, 195, ,96, 197, 198, .99,200,201,202, 
203' 204, 205, 206, 207, 208,209, 210, 21 1, 2.2, 2.3, 2,4, 215, 2,6, 217, 2,8, 219, 220, 
22l' 222 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 
240 241 242, 243, 244, 245, 246, 247, 248, 249, 250, 251 , 252, 253, 254, 255, 256, 257, 
25 258 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 27,, 272, 273, 274, 275, 
276 277 278,279,280,28,,282,283,284,285,286,287,288,289,290,29,,292,293, 
294 295 296 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 31 1, 
3,2 3,/ 3,4, 315, 316, 317, 318, 319, 320, 32,,322, 323, 324, 325, 326, 327, 328, 329, 
330 332 333, 334, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 
30 35o', 35,, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 
369 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 382, 383, 384, 386, 387, 388, 
389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 
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407, 408, 409, 410, 411,412, 413, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 426, 
427, 428, 429 ? , 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 
445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 
463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 
5 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 
499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 515, 516, 517, 
Sis', 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 
536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 
554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 
10 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 
59l', 592, 593, 594, 595, 597, 598, 599, 600, 601, 602, 604, 605, 606, 607, 608, 609, 610, 
61 1! 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 629, 
630, 631, 632, 633, 634, 635, 636, 637, 639, 640, 641, 643, 644, 645, 646, 647, 648, 649, 
650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662, 663, 664, 665, 666, 667, 
15 668, 669, 670, 671, 673, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 
688, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 705, 706, 
707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 
725, 726, 727, 728, 729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 
743, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 
20 76 3 ! 764', 765. 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 
781, 782, 783, 784, 785, 786, 787, 788, 789, 790, 791, 792, 793, 794, 795, 796, 797, 798, 
799, 800, 801, 802, 803, 804, 805, 806, 807, 808, 810, 811, 812, 813, 814, 815, 816, 817, 
818, 819, 820, 821, 822, 823, 824, 825, 826, 827, 828, 829, 830, 831, 832, 833, 834, 836, 
837, 838, 839, 840, 841, 842, 843, 844, 845, 846, 847, 848, 849, 850, 851, 852, 853, 854, 
25 855! 856, 857, 858, 859, 860, 861, 862, 863, 864, 865, 866, 867, 868, 869, 870, 871, 872, 
873, 874, 875, 876, 877, 878, 879, 880, 881, 882, 883, 884, 886, 887, 888, 889, 890, 891, 
892, 893, 894, 895, 897, 898, 899, 900, 901, 902, 903, 904, 905, 906, 907, 908, 909, 910, 
91 \, 912, 913, 914, 915, 916, 917, 918, 921, 922, 923, 924, 925, 926, 927, 928, 929, 930, 
93l', 932, 933, 934, 935, 936, 937, 939, 940, 941, 942, 943, 944, 945, 946, 947, 949, 950, 
30 95l', 952, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 
969, 97o! 971, 973, 974, 975, 976, 977, 978, 979, 980, 981, 982, 983, 984, 985, 986, 987, 
988, 989, 990, 991, 992, 993, 994, 995, 996, 997, 998, 1000, 1001, 1002, 1003, 1004, 1005, 
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1006, 1008, 1009, 1010, 1011, 1012, 1013, 1015, 1016, 1017, 1018, 1019, 1020, 1021, 
1022, 1023, 1024, 1025, 1026, 1028, 1029, 1030, 1031, 1032, 1033, 1034, 1035, 1036, 
1037, 1038, 1039, 1040, 1041, 1042, 1043, 1044, 1045, 1046, 1047, 1048, 1049, 1050, 
1051, 1052', 1053, 1054, 1055, 1056, 1057, 1058, 1059, 1060, 1061, 1062, 1063, and 1064; 

5 and any and all combinations of the foregoing sets. 

The polypeptide translation products of those polynucleotide sequences form sets of 
preferred olfactory receptor polypeptides, as well as any and all combinations of those 
polypeptide sets. The preferred sets of polypeptide translation products, and any and all 
combinations thereof, are also preferred sets for use as libraries of olfactory receptors for 

10 scent analysis. 



Scent Fingerprinting 

It will be appreciated that in many instances, analysis of a scent (whether in terms of 
15 receptor primary scent components, receptor quasi-primary scent components, receptor 

complex scent components, or other scent representations) is of great utility in and of itself, in 
addition to the utility of that analysis in scent re-creation. Thus, another embodiment of the 
invention encompasses "scent fingerprinting," which comprises analysis of a scent profile when 
re-creation of that scent may not be necessary or desirable. The distinction between scent 
20 profiling, as defined above, and scent fingerprinting, as defined here, is that scent profiling is a 
representation of a scent relative to a mammalian olfactory system in such a manner as to 
provide useful information about the interaction of the scent with that olfactory system, such as 
sufficient information to enable re-creation of the scent from receptor primary scent 
components. In contrast, scent fingerprinting can, but does not necessarily, provide such 
25 information. 

Various applications and examples of scent fingerprinting can include, but are not 
limited to, the following illustrative situations. Natural gas is widely used as a heating and fuel 
supply, but is in itself odorless. Utility companies routinely add small amounts of odorants 
such as mercaptans to allow detection of natural gas leaks in households. Should a leak occur 
30 at an unattended site, however, potentially dangerous quantities of natural gas can accumulate. 
In such areas, a device which can recognize odorants would be useful. 

Another use of scent fingerprinting is quality control of a manufacturing process. 
Many food items, such as freshly-baked bread and pastries, sauces, and cheeses, have distinct 
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odors. A manufacturer can record a scent fingerprint for a given food item, e.g. spaghetti sauce 
for packaging in jars. The quality of the product can then be monitored at various stages in 
manufacture and storage, and deviations from the established scent fingerprint can be used to 
alert the manufacturer to problems in manufacture or storage. Quality control scent fingerprints 
are not limited to food items, but can be used in any circumstance where a volatile component 
of an item of manufacture can be used as a quality control indicator, e.g., perfume, deodorants, 

solvent mixtures, etc. 

While scent fingerprints need not be meaningful in terms of a mammalian olfactory 
system, it will be readily appreciated that a scent profile, which does represent a scent in a 
manner relevant to an olfactory system, is a special type of scent fingerprint. Additionally, the 
response of a device which yields a scent fingerprint of an odor (such as the "artificial nose" 
described in U.S. Pat. Nos. 5,571,401, 5,698,089, 5,788,833, 5,891,398 and 5,91 1,872) can be 
calibrated against the response of a mammalian olfactory system in order to transform the scent 
fingerprint generated by the device into a true scent profile which can be utilized to re-create an 
odor using receptor primary scent components, receptor quasi-primary scent components, or 
receptor complex scent components. The invention encompasses such data transformations. 

Scent Editing 

Representation of a scent as a scent profile provides the capability of editing the scent. 
A scent profile which represents a scent in terms of perceptive primary scent components is the 
most straightforward representation to edit. An example is the perceptive complex primary 
scent of "burned pizza" comprised of perceptive primary scent components of sausage, cheese, 
tomato sauce, and burned dough. In order to edit the scent to provide a more pleasant re- 
creation, the perceptive primary scent component of burned dough would simply be eliminated. 

Other scent profiles can be edited using a knowledge of the perception of a particular 
components. Using our six-receptor example, suppose that the (1, 0, 0, 0, 1, 0) receptor 
complex scent component is known to provide an unpleasant aspect of the scent, while the 
(0, 1, 1, 1, 0, 1) component is known to provide the pleasant aspect of the scent. The first 
complex scent component can be omitted from the edited scent profile, leaving (0, 1, 1, 1, 0, 1) 
as the edited scent profile. (This would also alter the index values for scent re-creation, from 1 
and 1, to 0 and 1.) More complex editing situations can be manipulated using computer 
algorithms as discussed above. 
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Individual scent components can be omitted, added, weakened, or intensified, and 
different scent components can be adjusted in different manners or degrees, depending on the 
desired result. The editing can be done interactively, with each edited scent emitted by the 
emitter module for evaluation by the user, or can be done automatically, with 
removal/weakening or addition/intensifying of particular components specified in advance, on 
either an absolute scale or relative to other components. 

The following examples are presented to illustrate, but not to limit, the invention. 

EXAMPLES 

Example 1: Isolation of human olfactory receptor cDNAs 

Total RNA was extracted from human olfactory epithelium and polyA + RNA was 
obtained by oligo-dT selection. This RNA served as template for cDNA synthesis using 
reagents from the SMART cDNA Library construction kit (Clontech Kl 05 1-1; Palo Alto, 
CA). The Superscript II™ reverse transcriptase (Life Technologies, Gaithersburg, MD) 
was used for first-strand synthesis. 

Double-stranded cDNA was passed through a Chroma-Spin + STE-100 column 
(Clontech) to remove unreacted primers and cDNA fragments shorter that 100 nucleotides. 
The olfactory epithelial cDNA population was then subjected to amplification using 
primers homologous to conserved regions in GPCRs. The first primer set was homologous 
to transmembrane segment 2 (TM2) and the second set was homologous to TM 7.5. The 
TM2 primer set contained 32 oligonucleotides, representing all possible nucleotide 
sequences capable of encoding the TM2 amino acid sequence motif P-M-Y-F/L-F/Y-F/L, 
and designed to be non-degenerate at their 3' ends. Sequences of the TM2 primers are as 



follows: 



CCN ATG TAY TTN CTC CTA SEQ D NO: 74 

CCN ATG TAY TTN CTC CTC SEQ D NO: 75 

CCN ATG TAY TTN CTC CTG SEQ D NO: 76 

CCN ATG TAY TTN CTC CTT SEQ D NO: 77 

CCN ATG TAY TTN CTC TTA SEQ D NO: 78 

CCN ATG TAY TTN CTC TTC SEQ D NO: 79 

CCN ATG TAY TTN CTC TTG SEQ D NO. 80 

CCN ATG TAY TTN CTC TTT SEQ D NO 81 

CCN ATG TAY TTN CTT CTA SEQ D NO. 82 

CCN ATG TAY TTN CTT CTC SEQ D NO: 83 

CCN ATG TAY TTN CTT CTG SEQ ID NO: 84 
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CCN ATG TAY TTN CTT CTT SEQ D NO. 85 

CCN ATG TAY TTN CTT TTA SEQ D NO. 86 

CCN ATG TAY TTN CTT TTC SEQ D NO: 87 

CCN ATG TAY TTN CTT TTG SEQ D NO 88 

5 CCN ATG TAY TTN CTT TTT DNO 

CCN ATG TAY TTN TTC CTA SEQ D NO. 90 

CCN ATG TAY TTN TTC CTC SEQ D NO: 91 

CCN ATG TAY TTN TTC CTG SEQ D NO: 92 

CCN ATG TAY TTN TTC CTT SEQ ID NO: 93 

10 CCN ATG TAY TTN TTC TTA c^ nNO-95 

CCN ATG TAY TTN TTC TTC SEQ ID NO. 95 

CCN ATG TAY TTN TTC TTG SEQ ID NO: 96 

CCN ATG TAY TTN TTC TTT SEQ ID NO: 97 

CCN ATG TAY TTN TTT CTA SEQ ID NO: 98 

15 CCN ATG TAY TTN TTT CTC SEQ ID NO: 99 

CCN ATG TAY TTN TTT CTG SEQ ID NO: 100 

CCN ATG TAY TTN TTT CTT SEQ ID NO: 1 01 

CCN ATG TAY TTN TTT TTA SEQ ID NO: 1 02 

CCN ATG TAY TTN TTT TTC SEQ ID NO: 1 03 

20 CCN ATG TAY TTN TTT TTG 05 

CCN ATG TAY TTN TTT TTT SEQ ID NO. 1 05 

The TM7.5 primer set was designed to contain the reverse complement of all 
sequences capable of encoding the TM7.5 amino acid sequence motif P-F/L/I/V-I/V-F/Y- 
25 S/T-L. The sequences of the TM7.5 primers are as follows: 

YYTNGTNYTNRYNCYGATANATNATNGGRTT SEQ ID NO: 1 06 
YTRTTNCKNAGNWRTANATRAANGGRTT SEQ ID NO: 107 

TCYTTRTTNCKNAGNGWRTANAYNASNGGRTT SEQ ID NO: 108 
30 TCNTSRTTNCKNARNSARTANATNATNGGRTT SEQ ID NO: 109 
RTTNCKNARNSWRTANATRAANGGRTT SEQ ID NO: 1 10 

Reagents and enzymes for amplification were from the Advantage cDNA 
amplification kit (Clontech). A primary amplification reaction was constructed as follows: 
35 5 nl olfactory epithelial cDNA (1 0-20 pg/ml) 

5 pi 10X PCR reaction buffer (Clontech) 
1 pi TM2 primer set (10 uM) 
1 pi TM7.5 primer set (10 uM) 

1 pi dNTP mix (10 mM each dATP, dCTP, dGTP, dTTP) 
40 36 pi PCR-grade H 2 0 

1 pi Advantage polymerase mix (Clontech) 
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Amplification was conducted in a PE 480 thermal cycler, using 28 cycles of 95°C 
for 15 sec, 45°C for 45 sec and 72°C for 2 min. After cycling, the amplification mixture 
was treated for 1 hour at 37°C with 10 Units of BspEI and 10 Units of Pstl restriction 
enzymes, to degrade non-specific amplification products. 

The primary amplification products were size-fractionated by agarose gel 
electrophoresis, and amplification products having a length between 600 and 800 base pairs 
were selected for secondary amplification. 

The secondary amplification reaction was conducted identically to the primary 
amplification reaction, except that the size-selected primary amplification product was used 
as template. Secondary amplification reactions containing products which generated a 
specific gel band of between 600 and 800 base pairs were extracted once with 
phenol/chloroform and once with chloroform, and nucleic acids were precipitated from the 
reactions by addition of 0. 1 volume of 3M NaOAc (pH 4.8), 20 ug glycogen, and 1 .5 
volumes of cold 95% ethanol. The precipitate was collected by centrifugation, dried and 
resuspended in 15 ul distilled water. After the precipitate dissolved, 3 ul loading dye was 
added, and the sample was subjected to electrophoresis on a 1 .0% low-melting agarose gel 
containing ethidium bromide. Electrophoresis was conducted at 60V for approximately 
40 min, with a 1 kb marker in adjoining lanes. 

Following electrophoresis, the gel was illuminated with long-wavelength ultraviolet 
light, and the band was excised from the gel. The gel slice was placed in a 0.5 ml tube, and 
the tube was heated at 68°C for 1 5 min. The temperature of the tube was then equilibrated 
at 45°C. (This is conveniently accomplished in a thermal cycler.) AgarACE™ (Promega) 
was then added to the tubes, according to the manufacturer's instructions, and incubation at 
45°C was continued for 15 min. As a general rule, 2 ul of enzyme per 50 ul of gel slice is 
adequate. Following AgarACE™ digestion, the digestion mixture was extracted with 
phenol/chloroform according to the manufacturer's instructions, and nucleic acids were 
precipitated by addition of 0.1 volume of 3M NaOAc (pH 4.8), 20 ug glycogen, and 1 .5 
volumes of cold 95% ethanol. The precipitate was collected by centrifugation, dried and 
resuspended in 5 ul distilled water. 

Gel-purified amplification products were cloned using the TOPO XL PCR Cloning 
Kit (Invitrogen) according to the manufacturer's instructions. After cloning, individual 
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colonies were selected at random for nucleotide sequence analysis of the inserts, using 
procedures for sequence determination that are well-known to those of skill in the art. 

Example 2: Use of olfactory receptor polypeptides for screening 

Components of a scent are identified by determining the interaction between one or 
more potential odorant molecules and one or more OR polypeptides. For example, if a 
known original scent involves binding to a particular set of ORs, any subsequent set of 
molecules which bind to that same set of ORs and stimulate or inhibit the response of the 
ORs to the same extent as the original scent is capable of re-creating that original scent. If 
each of the subsequent set of molecules interacts with one and only one OR, then the set of 
molecules is composed of receptor primary scent components. In similar fashion, scents 
which involve binding of multiple ORs can be recreated by identifying a molecule, or 
combination of molecules, which binds to that particular set of ORs. 

Binding of molecules to ORs is determined by a number of methods that are well- 
known in the art including, but not limited to, in vitro and in silico methods as described 
herein. Binding of molecules to ORs can also be determined or approximated by using 
quantitative structure-activity relationships as described herein. 

Example 3: Identification of agonists and antagonists of olfactory receptors 

Interaction of an odorant with a particular OR embedded in the membrane of an 
olfactory neuron will activate a signaling cascade within the neuron, ultimately resulting in 
the perception of a particular smell. A molecule, produced for example by combinatorial 
chemistry, which activates a similar or identical signaling cascade, will induce the 
perception of the same smell. Such a molecule would be considered a OR agonist. An OR 
agonist, once identified, can be used as a probe to identify additional agonists, as well as 
antagonists, of that particular OR. 

Assays for the activation and the end product(s) of signaling cascades are known in 
the art. For example, direct Ca^ imaging can be employed, using either dye -labeled Ca^ 
or dyes that are sensitive to Ca~ concentration. Such dyes, and techniques for their use, 
are available from, for example, Molecular Dynamics (Sunnyvale, CA) and Molecular 
Probes (Eugene, OR). 
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Because ORs are transmembrane proteins, identification of agonists and/or 
antagonists for a particular OR require that the OR is present either in a living cell or in a 
membrane preparation. 

In one embodiment of a method for the determination of OR agonists or 
5 antagonists, a known OR agonist is labeled in situ, or is resynthesized with an attached 
label, and is bound to an OR. The effect of various test molecules on the binding of the 
labeled OR agonist is then determined. Labeling of an OR agonist is accomplished by any 
of a number of methods that are known to those of skill in the art including, but not limited 
to, various fluorescent labels (for example, chemical fluorochromes or green fluorescent 

1 0 protein). Binding of the OR agonist is measured by any of a number of competitive 

binding assays, as are known in the art. A test molecule that displaces the agonist from the 
OR {i.e., reduces the binding of the agonist) is identified as a candidate agonist or 
antagonist of the particular OR. In a subsequent experiment, the candidate molecule is 
bound to the OR, and the effect on the signaling cascade induced by the original agonist is 

1 5 determined. A similar of higher level of activation is indicative of an agonist; while a 
reduced level of activation of the signaling cascade reflects the action of an antagonist. 

In additional embodiments of the displacement assay, an unlabeled agonist is used, 
and its degree of binding is determined by mass spectrometry. See, for example, U.S. 
Patent No. 5,894,063; U.S. Patent No. 5,719,060; and Wei et al (1999) Nature 399:243- 

20 246. 

In another embodiment, fluorescent microparticles ("beads"), which can be 
separated by flow cytometry, are used to identify OR agonists and antagonists. Such beads 
are available, for example, from Luminex (Austin, TX). Multiple different ORs are 
attached to the beads, wherein each distinct color of bead is associated with a particular 

25 OR. The collection of beads, containing different ORs, is exposed to a test molecule or a 
collection of test molecules, such as can be synthesized by combinatorial chemistry, and 
binding of the test molecule(s) is determined, for example, by use of a labeled ligand of the 
test molecule(s). The beads are sorted according to their color by flow cytometry. 
Correlation of test molecule binding with bead color allows the determination of test 

30 molecules capable of binding to the OR. Agonist or antagonist function of an OR binding 
molecule is determined by methods described supra. 
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Example 4: Summary of search parameters for homology searches 

Step 1 : (masking) rempolyatmask raw sequence on -NONE- [?] with remAT_moderate 
(15). Continue to step 2. 

Step 2: (masking) mask masked sequence from step 1 on RepBase [N] with 
5 mask_moderate (85) . Continue to step 3. 

Step 3: (masking) mask masked sequence from step 2 on VecBase [N] with 
mask_moderate (85) . Continue to step 4. 

Step 4: blastn masked sequence from step 3 on NR-Nuc [N] with blastn_10_hits (V=10 
B=10) . If the P/Z score is > 1 .0E-50, or no hits are found go to step 5. Otherwise, stop. 
1 0 Step 5 : blastx masked sequence from step 3 on NR-Pro [P] with blastxj 0_hits (V= 1 0 
B=10) . If the P/Z score is > 1.0E-50, or no hits are found go to step 6. Otherwise, stop. 
Step 6: blastn masked sequence from step 3 on GB_CurAwareness-Nuc [N] with 
blastn_10_hits (V=10 B=10) . If the P/Z score is > 1 .0E-50, or no hits are found go to step 

7. Otherwise, stop. 

1 5 Step 7: blastx masked sequence from step 3 on GB_CurAwareness-Pro [P] with 

blastxj 0_hits (V=10 B=10) . If the P/Z score is > 1 .OE-50, or no hits are found go to step 

8. Otherwise, stop. 

Step 8: tblastx masked sequence from step 3 on NR-Nuc [N] with tblastxj 0_hits (V=10 
B=10) . If the P/Z score is > 1. OE-50, or no hits are found go to step 9. Otherwise, stop. 
20 Step 9: blastn masked sequence from step 3 on EST [N] with blastn_10_hits (V=10 B=10) . 
If the P/Z score is > 1 .OE-50, or no hits are found go to step 1 0. Otherwise, stop. 
Step 10: blastn masked sequence from step 3 on STS [N] with blastn_10_hits (V=10 B=10) 
. Stop. 



48 



WO 01/27158 



PCT/USOO/27582 



Example 5: Summary of search results 
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Example 6. Datamining and analysis from GenBank 

Daiamining. A datamining pipeline was built to detect all available OR-like 
sequences in the public databases and to update the results as new database versions are 
released, tblastn (Altschul et al„ 1997) was used to compare amino acid query sequences 
to the non-redundant version of GenBank (partitions nt, htg and estjiuman, all updated to 
August 6th, 2000), with a non-stringent expectation value cutoff of le-4. The queries used 
included 96 curated OR sequences representing all known families (SEQ ID NO:265 1 
through SEQ ID NO:2747) and 249 additional HORDE entries (SEQ ID NO:2402 through 
SEQ ID NO:2650). In a second round 1 05 newly mined mouse genes (SEQ ID NO:2296 
1 5 through SEQ ID NO:2401 ) and 344 newly mined human genes (SEQ ID NO:2009 through 
SEQ ID NO:2295) were used as additional queries (all datasets are available 
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demonically). All resuhing da,abase en<nes were catalogued by species and subdivided 
Keyword mOS PHASE,-3,orwith,eng.ha,,eas, lOkb. Low-pass genomtc samplutg 

; sequenceugODSDsequencswasused. All se q uences used were sp. , nt oco„«gs 
adding .0 anno,a,ion or, where unavailable, according .0 runs of a. leas, 50 Ns 

ol, iW Subconrigs were defrned as segments between interspersed repeaa, rgnorrng 
simple repeats and low-complexity regions. 

*~ The University of Santa Cruz (UCSC) Worfcng 

Oraf, Sequence ("golden path", hnp./genome.ucsc.edu) presents a ft— assemb.y 
of fte fled and draft human genomic science based on the WUSTL Cone map 

ac filed or unfinished genomic Cone, in Mb from ft. P telomere. Inparalle, fire 

;- r Ti ooTes 

„ere integrated baaed on the coordina.es of Cones tha, cou,d be locahzed . *fcO«- 
for which no coordinate could be obtained by either method were asa.gned a chromosome 
^clg.oUDB.by^esimil^toanoftermappedclone.byannotarton.orby. 

20 PCR(Schuler, 1997). . fastY fPearson 

De.eOon of OR sequences. Each subconfig was compared usmg FASTY (Penrson 

e, a! .997) to a curated set of OR protein sequences from several species, 
conceptual translation product. The possibility of a pseudogene being disrupted by the 
I! of interspersed repeats was taken into account, with the two or more resuhmg 
25 "ngfteref ^re located in different subcontigs. Such compatible candidate sequenc s 

1 utomatically Joined into a combine, recons.ruc.ed pseudogene. Whenever poss.b e, 
::2gseJncesweren^edorex,endedtouseasui,ab,eATOcod„»or.m,,fion 

275 amino acids. The sequences were finally split into OR or non-OR by comparing them 
30 to previously recognized OR sequences and to a non-rednndan, data ase of non-OR 

GPCRs which we ex.ac.ed from Swiss-Pro.. To he au.oma.iea.ly Casstfied as an OR, a 
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new sequence has to be a, least 40% identical ove, a, .east 100 amino acids ,o another OR. 
A more stringent cutoff (50%) was required for shorter sequences. 

DefinMon of OR genes. A given gene could be represented in more than one 
overlapping genomic Cone. Such redundancy was removed by considering two sequences 
5 « representing the same gene, if they are in the same chromosome, ,oca,ed in clones ess 
than 300 Kb apart and a, leas, 99% identical a, the nucleotide level. An exception to tins 
nne is when two genes coappear in the same clone, in which case they were constdered to 
be distinct genes. Sequences Realized to a chromosome bu, without a coord,na,e were 
only compared ,o other sequences within that chromosome, and finally those sequences 
,0 lacking a chromosomal assignment were compared to the resu applying only tire crttenon 
of sequence similarity. For each resting gene with more than one constituent . 
eonlus nucleotide sequence was created after muuiple afignmen. by ClustalW (Htggtns 
e. a, , 1996) using tire fas, comparison parameter. This was followed by conceptual 
nation and end .rimming ,0 sui,ab,e « and stop codons, as above. Oenes wtth length 
15 a, leas, 275 ammo acids withou, fiame disruptions (frameshins, in-fiame stop codons or 
disrupting interspersed repeats) were considered to be MHengti, and apparently rntact. 
For partial sequences withou, frame disruptions no statement could be made on to 
apparent functionality, excep, when the partial sequences were observed in tire genome * 
Z, in which case they were considered ,0 be pseudogenes. Finally, each OR gene was 
20 assigned a family and subfamily by amino acid sequence similarity to previous,, classified 

0R ge "^e references ci,ed in fitis example are: Ahschul, S. F, Madden, T. L„ Schaffer 
A. A., Zhang, J-, Zhang, Z, Miller, W. and Lipman, D. J. (1997, Gapped BLAST 
BLAST- a new generation of protein database search programs. Nucleic Acds Res 25: 
25 3389-402; Chalifa-Caspi, V., Prilusky, J. and Lancet, D. (.998) The Unified Database. 

Weianann Institute of Science, Bioinformatics Unit and Genome Center (Rchovot, Israe ,. 
World Wide Web URL: b,oinformatic S .weizmann.ac.iVudb; Higgins, D. G., Thompson, L 
D and Gibson, T. J. (1996) Using CLUSTAL for multiple sequence angnmentsj Methods 
Enzymol 266: 383-402; Pearson, W. R-, Wood, T. Zhang, Z. and Miller, W W 
30 Comparison of DNA sequences with protein sequences. Genomics 46: 24-36; Schuler, O. 
D (,997) Sequence mapping by e.ectronic PCR. Genome Res 7: 541 50; and Smrt, A. F. 
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A. and Green, P. (1997) RepeatMasker at URL: repeatmasker.genome.washington.edu/cgi- 
bin/RM2_req.pl. 

Tables 1 and 2 contain additional information regarding SEQ ID NO. 153 to SEQ 
5 ID NO. 1 085. The explanation of the entries in Tables 1 and 2 is as follows: 

Symbol: The Human Genome Organization gene symbol, as allotted by a procedure 
to be published soon. OR = Olfactory Receptor, numeral to the immediate right - family 
designation, capital letters - subfamily designation, rightmost numeral - individual gene 
within subfamily, n appearing when such number is not assigned yet; P = Pseudogene. 
10 All ORs within a family share at least 40% protein sequence identity. 

All ORs within a subfamily share at least 60% protein sequence identity. 
HORDE: The H serial number within the Human Olfactory Receptor Data 
Exploratorium (URL bioinfo.weizmann.ac.il/HORDE). The numeral 38 represents the 
HORDE build (version), gxxx is the individual gene number. 
1 5 Digi: Appearance of a DSnn serial number here means that the sequence has been 

PCR-amplified from human olfactory epithelial cDNA using degenerate primers at the 
transmembrane helix 2 and transmembrane helix 7. See separate page for explanations on 

the analysis of the DS entries. 

OST: OSTnnn is the serial number of the sequence in the Olfactory Sequence Tag 
20 collection in the Lancet laboratory (URL bioinfo.weizmann.ac.il/HORDE). Appearance 
here means that the sequence has been PCR-amplified from human genomic DNA using 
degenerate primers at the transmembrane helix 2 and transmembrane helix 7. There are a 

total of 1 12 OST sequences. 

Trivial name: One or more aliases given to the same gene by different laboratories. 
25 Many of the trivial names are of the form ORnn-xx, whereby nn is a chromosome number 

and xx is an arbitrary numerical identifier. 

Tran: (transcribed) Plus appears if the entry was sequenced from cDNA, or was 

found in the Expressed Sequence Tags (EST) databases. Plus also appears if in the public 

databases the gene was annotated as mRNA. 
30 Int: (intact) "Yes" indicates that the gene may be intact, as there are no obvious 

sequence frame disruptions. "Put" (putative) indicates the same, except that the known 

sequence is short, hence there may be disruptions in the unsequenced segments. "Pol" 
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indicates a polymorphism between intact and pseudogenic alleles. When no word appears, 

this indicates a pseudogene. 

E: (Extent) FL indicates that the Full Length sequence is known (typically 310 ± 30 

amino acids). 

5 D^The number of sequence disruptions in the known sequence of a pseudogene. 

GThe human chromosomal location of the OR gene, assigned as described under 
Mb coord. 

Mb coord: The location of the OR gene within a human chromosome, in magabase 
units, beginning at the p-telomere and ending at the q-telomere, computed based on 
10 integrating information from Unified Database (URL is bioinfo.weizmann.ac.il/udb) and 
the University of California Santa Cruz (URL is genome.ucsc.edu). 

CDR: The 17 amino acids suggested to line the odorant ligand binding pocket, 
delineated by the extracellular 2/3 of transmembrane helices 3,4 and 5. The assignment is 

based on an algorithm at URL 
1 5 bioinformatics.weizman.ac.il/HORDE/humanGenes/CDR.html. 

%: (% id) The percent protein identity between the human sequence in the current 

line and the known rodent (rat or mouse) OR sequence to which it bears the highest 

similarity. 

S: (Species) Rat (R)or mouse (M). 
20 Ace: The Genbank accession number of the clone that contains the rodent sequence. 

Range: The positions x ... y of the first and last bases within the rodent which 
constitute the OR coding region. If x>y then the OR is on the reverse strand. 
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FL 


208 


ORllHnP 


H38g05 
6 












FL 


209 


OR7EnP 


H38g05 
7 








+ 






210 


OR9In 


H38g05 
8 










yes 


FL 


211 


OR2AFnP 


H38g05 
9 












FL 


212 


OR13KnP 


H38g06 
1 












FL 


213 


OR13Cn 


H38g06 
2 










yes 


FL 


214 


OR13Fn 


H38g06 
3 










yes 


FL 


215 


OR9Qn 


H38g06 
4 










yes 


FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int . 


E 


216 


DR2TnP 


H38g06 
5 












FL 


217 


0R4Kn 


H38g06 
6 










yes 


FL 


218 


OR2B8P 


H38g06 
7 






dJ313I6 .4 ;hs6Ml-29P 




yes 


FL 


219 


OR2Tn 


H38g06 
8 










yes 


FL 


220 


OR4Kn 


H38g06 
9 










yes 


FL 


221 


OR2A4 


H38g07 
0 






WUGSC:H_DJ0988G15 .2 


+ 


yes 


FL 


222 


OR7EnP 


H38g07 
1 












FL 


223 


OR4Kn 


H38g07 
2 










yes 


FL 


224 


OR13InP 


H38g07 
3 












FL 


225 


OR7EnP 


H38g07 
4 












FL 


226 


OR6Jn 


H38g07 
5 










yes 


FL 


227 


OR4Mn 


H38g07 
6 










yes 


FL 


228 


OR4VnP 


H38g07 
7 












FL 


229 


OR6Xn 


H38g07 
8 










yes 


FL 


230 


ORSIGn 


H38g07 
9 










yes 


FL 


231 


OR6EnP 


H38g08 
0 












FL 


232 


OR4NnP 


H38g08 

1 












FL 


233 


OR6MnP 


H38g08 
2 












FL 


234 


OR4Nn 


H38g08 
3 










ves 


FL 


235 


OR4Cn 


H38g08 
4 










yes 


FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


ncT 


Trivial 


Tran 


Int . 


E 


236 


OR4KnP 


H38g08 
5 












FL 


237 


ORnP 


H38g08 
6 














238 


OR5D3 


H38g08 
7 




Ob i y uo 


OPT 1 - fib* • OPT 1 - Re 








239 


OR2G1P 


H38g08 
8 


DS13 ;D 
S16 


OST619 


au y /4xxi . 4 ; nsoMx- zz> 


+ 




FT. 


240 


OR4Kn 


H38g08 
9 














241 


OR8BnP 


H38g09 
0 












FT 
r Xj 


242 


OR2B2 


H38g09 
1 






0R6-1 ;aJ193B12 . 4 




yes 


r Xj 


243 


OR7EnP 


H38g09 
2 












r X* 


244 


OR4KnP 


H38g09 
3 












FT 
r Xi 


245 


OR2AD1P 


H38g09 
4 






dJ2 5J6 . 1 ; hs6Ml-8P 






CT 

r Li 


246 


ORlAAnP 


H3 8g09 
5 












T?T 
r Xj 


247 


OR1E3P 


H38g09 
6 






OR17-210 






r Xi 


248 


OR8BnP 


H38g09 
7 












I* Xj 


249 


OR5Hn 


H38g09 
8 










yes 


FT. 


250 


OR1G1 


H38g09 
9 




OST909 


ORl 7-130; OR1 7 - 2 U y 


+ 


yes 


FT. 


251 


OR5HnP 


H38gl0 
0 












FL 


252 


ORnP 


H38gl0 
1 














253 


ORnP 


H3 8gl0 
2 














Z D H 


OP 4 Pr» P 
tr hit 


3 












FL 


255 


OR13Hn 


H3 8gl0 
4 










yes 


FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


x l. ctii 


Int . 


E 


256 ( 


DR7D1P I 


438gl0 
5 


( 


DST910 


CIT-B-440L2 ;OKiy- 
131/OR19-A 






FL 


257 < 


DR4KnP 1 


•138gl0 

5 












FL 


258 < 


DR7E24 


H38gl0 
7 




OST911 


CIT-B-440L2 ; ORiy- 0 


+ 




FL 


259 


0R51NnP 


H38gl0 
8 












FL 


260 


OR7E18P 


H38gl0 
9 




OST912 


OR19-14 ;TPCR26 


+ 






261 


OR7E19P 


H38gll 
0 




OST913 


HSCIT-B-440L2 ;OR19- 
7;TPCR110 


+ 




r J-i 


262 


OR7E41P 


H38gll 
1 




OST914 


ORll-20;hg84 






r Li 


263 


0R2R1 


H38gll 
2 




OST058 








TTT 


264 


ORlOACn 
P 


H38gll 
3 












X7T 

r JLi 


265 


ORSILn 


H38gll 
4 










yes 


t Li 


266 


0R52JnP 


H38gll 
5 












FT, 


267 


OR9LnP 


H38gll 
6 














268 


ORBlPnP 


H38gll 

7 












ITT 


269 


ORSHnP 


H38gll 
8 














270 


OR51An 


H38gll 
9 










yes 


FT 


271 


OR5HnP 


H38gl2 
0 












FT. 


272 


ORnP 


H38gl2 
1 














273 


OR52En 


H38gl2 
2 










yco 


FL 


274 


ORbrin 


3 










yes 


FL 


275 


OR4CnP 


H38gl2 
4 












FL 
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SEQ i 
ID # 


Symbol 


HORDE 


Digi 


Ob 1 


T3ri.vi.3l 


Tran 


Int. 


E 


276 C 


R52En V 

c 


I38gl2 










yes 


FL 


277 C 


)R10Dn I 
( 


I38gl2 










yes 


FL 


278 C 


)R5HnP I 


i38gl2 

7 












FL 


279 ( 


)R13An 1 


438gl2 
3 










yes 


FL 


280 < 


DR5HnP 


H38gl2 
9 












FL 


281 


0R5Kn 


H38gl3 
0 










yes 


FL 


282 


OR7EnP 


H38gl3 
1 












FL 


283 


OR4DnP 


H38gl3 
2 












FL 


284 


OR2ARnP 


H38gl3 
3 














285 


OR7E29P 


H38gl3 
4 




OST032 








FL 


286 


OR4CnP 


H38gl3 
5 












FL 


287 


OR5PnP 


H38gl3 
6 












FL 


288 


OR7EnP 


H38gl3 
7 












FL 


289 


OR56An 


H38gl3 
8 










yes 


FL 



290 


OR56AnP 


H38gl3 
9 














291 


OR5Pn 


H38gl4 
0 










yes 


FL 


292 


OR7E53I 


> H38gl4 
1 




OST915 


r\T> T 1 A O • CSX3 ~X - 1 4 ~k 






FL 


293 


OR5Pn 


H38gl4 
2 










yes 


FL 


I z y ** 




H38gl4 
3 










yes 


FL 


295 


OR5E1 


H38gl4 
4 






HSTPCR24 


+ 




FL 
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SEQ ! 
ID # 


Symbol 


HORDE 


Digi 


OST 


1 L X v J- a J- 


Tran 


Int. 


E 


296 C 


)R56AnP I- 

c 


[38gl4 














297 C 


)R4KnP I 
( 


J38gl4 














298 ( 


3R52Ln I 


i38gl4 

7 










yes 


FL 


299 < 


DR7EnP 1 


*38gl4 
B ] 














300 < 


DR52XnP 


H38gl4 
9 J 












FL 


301 


ORnP 


H38gl5 
0 I 














302 


OR56An 


H38gl5 

i ! 












FL 


303 


OR56AnP 


H38gl5 
2 














304 


OR1R1P 


H38gl5 
3 






OR17-1 






FL 


305 


OR52EnP 


H38gl5 
4 












FL 


306 


OR51AnP 


H38gl5 
5 












FL 


307 


OR51An 


H38gl5 
6 










ves 


FL 


308 


OR4CnP 


H38gl5 
7 












FL 


309 


OR52JnP 


H38gl5 
8 












FL 


310 


OR4RnP 


H38gl5 
9 














311 


OR52Jn 


H38gl6 
0 










2 


FL 


312 


OR4CnP 


H38gl6 
1 












FL 


313 


0R51AnI 


> H38gl6 
2 












FL 






H38gl6 
3 












FL 


315 


OR5MnP 


H38gl6 
4 












FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


act 


i riviai 


Tran 


Int . 


E 


316 


ORlOABn 
P 


H38gl6 
5 












FL 


317 


OR52SnP 


H38gl6 
6 












FL 


318 


OR5Mn 


H38gl6 
7 










ves 


FL 


319 


ORlOSn 


H38gl6 
8 










y Co 


FL 


320 


ORSMnP 


H3 8gl6 
9 












FL 


321 


ORlOGn 


H38gl7 
0 










yes 


c u 


322 


ORnP 


H38gl7 
1 












FT, 


323 


ORSMnP 


H38gl7 
2 












FT. 

XT Xj 


324 


ORlOGnP 


H38gl7 
3 














325 


ORlOTnP 


H38gl7 
4 












T7T 
r Ij 


326 


ORnP 


H38gl7 
5 














327 


ORlORnP 


H38gl7 
6 












FT, 


328 


OR5MnP 


H38gl7 
7 












FT. 


329 


OR7EnP 


H38gl7 
8 












FL 


330 


ORlOTn 


H38gl7 
9 










yes 


FL 


331 


OR1E1 


H38gl8 
0 


DS37/D 
S43 ;DS 
46 


OST916 


HGMP07I;OR17-2;OR17- 
32 


+ 


yes 


FL 


332 


OR5BKnP 


H38gl8 
1 














333 


OR5MnP 


H38gl8 
2 












FL 


5 J H 




3 




OST917 


OR17-137;OR17- 
16;OR17-201 


+ 


yes 


FL 


335 


ORlOADn 
P 


H38gl8 
4 


DS10 






+ 




FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


TiTcin 


Tnt- 
±11 u * 


E 


336 


ORlORn 


H3 8gl8 
5 








+ 


yes 


r J-i 


337 


OR5TnP 


H38gl8 
6 












r Li 


338 


OR4GnP 


H38gl8 
7 












r J-i 


339 


OR6Yn 


H38gl8 
8 










yes 


ITT 

r Li 


340 


OR1E2 


H38gl8 
9 




OST918 


OR17-135;OR17-93 


+ 


yes 


Fit 


341 


ORSHn 


H38gl9 
0 










yes 


FL 


342 


OR4Fn 


H38gl9 
1 










yes 


FL 


343 


ORlOKn 


H38gl9 
2 










yes 


FL 


344 


OR7LnP 


H38gl9 
3 














345 


OR8InP 


H38gl9 
4 












FL 


346 


ORlORnP 


H38gl9 
5 














347 


OR2AFnP 


H38gl9 
6 












FL 


348 


OR8Kn 


H38gl9 
7 










yes 


FL 


349 


ORnP 


H38gl9 
8 














350 


OR8KnP 


H38gl9 
9 












T-»T 

F Ii 


351 


OR51Hn 


H38g20 
0 










yes 


r Li 


352 


OR7EnP 


H38g20 
1 












r Li 


353 


ORnP 


H38g20 
2 














354 


ORSBMnP 


Hibg^u 
3 












FL 


355 


ORlOGnP 


H38g20 
4 
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SEQ 
ID # 


Symbol 


HORDc. 


Digi 




Trivial 


Tran 


Int. 


E 


356 


OR2Yn 


H38g20 
5 










yes 


FL 


357 


ORlODnP 


H38g20 
6 












FL 


358 


OR3BnP 


H38g20 
7 












FL 


359 


OR8Dn 


H38g20 
8 










V6S 


FL 


360 


ORSRnP 


H38g20 
9 














361 


ORlOGn 


H38g21 
0 












FL 


362 


OR5BDnP 


H38g21 
1 












FL 


363 


OR5ALnP 


H38g21 
2 












FL. 


364 


OR52HnP 


H38g21 
3 














365 


ORlOGn 


H38g21 
4 










ye s 


FT. 


366 


ORBMn 


H38g21 
5 










yes 


FT. 


367 


ORSIMn 


H38g21 
6 










yes 


FL 


368 


OR6Tn 


H38g21 
7 


DS15/D 
S146;D 
S147 






+ 


yes 


FL 


369 


OR6DnP 


H3 8g21 
8 












FL 


370 


OR4B1 


H38g21 
9 




OST208 






yes 


FL 


371 


OR5ALnP 


H38g22 
0 












FL 


372 


OR51Qn 


H38g22 
1 












FL 


373 


OR4Dn 


H38g22 
2 










ves 


FL 


7 n a 




H38a22 
3 










yes 


FL 


375 


OR4Xn 


H38g22 
4 










yes 


FL | 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int . 


E 


376 


OR8Jn 


H38g22 
5 










yes 


FL 


377 


OR51JnP 


H38g22 
6 












FL 


378 


ORlOGn 


H38g22 
7 










yes 


FL 


379 


OR52En 


H38g22 
8 










yes 


FL 


380 


OR4Xn 


H38g22 
9 










yes 


FL 


381 


OR10A2 


H38g23 
0 


DS5;DS 
53;DS5 
6 


OST363 




+ 




FL 


382 


OR5Mn 


H38g23 
1 










yes 


FL 


383 


OR52En 


H38g23 
2 










yes 


FL 


384 


OR8Kn 


H38g23 
3 










yes 


FL 


385 


ORlOAn 


H38g23 
4 


DS55 






+ 


yes 


FL 


386 


OR8LnP 


H38g23 
5 












FL 


387 


ORSBPnP 


H38g23 
6 














388 


OR52Nn 


H38g23 
7 










yes 


FL 


389 


ORnP 


H38g23 
8 














390 


OR8JnP 


H38g23 
9 












FL 


391 


ORSMn 


H38g24 
0 










yes 


FL 


392 


OR52En 


H38g24 
1 










yes 


FL 


393 


OR5Tn 


H38g24 
2 










yes 


FL 


394 


OR52NnP 


H38g24 

3 












ITT 

r Li 


395 


OR4B2P 


H38g24 
4 




OST919 


hg44 9 






FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Trari 
X X Oil 


Int . 


E 


396 < 


DR51KnP 


H38g24 
5 












FL 


397 


OR52QnP 


H38g24 
6 












PL 


398 


0R4Fn 


H38g24 
7 










y e& 


FL 


399 


ORllMnP 


H38g24 
8 














400 


OR52Nn 


H38g24 
9 










yes 


C Li 


401 


OR56An 


H38g25 
0 










yes 


T?T 

r 1j 


402 


OR5AWnP 


H38g25 
1 












r Li 


403 


OR52Nn 


H38g25 
2 










yes 


r Li 


404 


ORnP 


H38g25 
3 














405 


OR52EnP 


H38g25 
4 












FL 


406 


OR5BHnP 


H38g25 
5 












r Li 


407 


OR4QnP 


H38g25 
6 












r Li 


408 


OR5lEn 


H38g25 
7 










yes 


r Lt 


409 


ORllKnP 


H38g25 
8 












Jf Li 


410 


OR12D1P 


H3 8g2 5 
9 






AC004174- 

B;dJ994E9.7;hs6Ml-19 






I* Li 


411 


OR4NnP 


H38g26 
0 












TTT 
r Li 


412 


OR11A1 


H3 8g26 
1 






AC004174- 

A;dJ994E9 . 6 ; hs6Ml-18 


+ 


yes 


FT. 


413 


OR10C1 


H3 8g26 
2 






AC004174 ;dJ994E9 . 5 ;n 
S6M1-17 


+ 


yes 


r xj 


414 


OR^Hl 


3 


UO X J- *i 




OLFR42A-9004-14 ;OR6- 
2 ;dJ994E9.4;hs6Ml-16 


| + 


yes 


FL 


415 


OR9RnP 


H38g26 
4 












FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 




I ilv lai 


Tran 


Int . 


E 


416 


OR4FnP 


H38g26 
5 














417 


OR7D4 


H38g26 
6 




r*\ rr\ r\ »"> f\ 

OST920 


UK l y - d ; ng j. u o 






FL 


418 


OR7E2 5P 


H38g26 
7 




0ST921 








FL 


419 


OR2D2 


H38g26 
8 






ORll-olO 




yes 


FL 


420 


OR 10 An 


H38g26 
9 












FL 


421 


OR2WnP 


H38g2 7 
0 








+ 






422 


OR7E16P 


H38g27 
1 




OST922 


CIT-B-440L2 ;ORl:?- 
133;ORl9-9 






FL 


423 


OR52Pn 


H38g27 
2 










yes 


FT. 


424 


OR6AnP 


H3 8g2 7 
3 












FT, 


425 


OR7D2 


H38g27 
4 


DS70;D 
S73 


OST923 


HTPCRH03 ;OR19-4 


+ 


yes 


FT. 


426 


OR52UnP 


H38g2 7 
5 












FL 


427 


OR2AGn 


H38g27 
6 










yes 


FL 


428 


OR7G3 


H38g27 
7 




OST085 






yes 


FL. 


429 


OR56BnP 


H38g27 
8 












FL 


430 


OR2AGnP 


H38g2 7 
9 












FL 


431 


OR56Bn 


H38g28 
0 










yea 


FL 


432 


OR6AnP 


H38g28 
1 












FL 


433 


OR4FnP 


H38g2 8 
2 












FL 




UKD nil 


H38a2 8 
3 










yes 


FL 


435 


OR4Mn 


H38g28 
4 










yes 


FL 
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ID # 


oyruDO j. 








Tri vis.1 




Int . 


E 


lie 

4 JO 


UK5Z I Ilr 


5 














43 / 


UK1 lrtnir 


TT "3 Q^-O O 

H3 BCJ^ b 

6 












VT. 


4 "3 Q 
4 J b 


UKyAn 


Ji3ogzb 
7 










yes 


FT 


439 


0R5MI1 


in o.— .o o 

H38g28 
8 










yes 


T?T 


440 


0R6Vn 


H3 8g2 8 
9 










yes 


T?T 


441 


0R4Nn 


H38g29 
0 








+ 


yes 


r Ij 


442 


ORSlAnP 


H38g29 
1 












FL 


443 


0R9PnP 


H38g29 
2 














444 


0R4H6P 


H3 8g29 
3 






0R15-71 ;OR15-82 






FL 


445 


ORSlFnP 


H3 8g2 9 
4 












FL 


446 


0R7E1P 


H38g29 
5 






AC004923 






FL 


447 


0R51Tn 


H38g29 
6 










yes 


FL 


448 


0R2Vn 


H38g29 
7 










yes 


F L 


449 


0R51HnP 


H38g29 
8 












FL 


450 


OR 51 An 


H38g29 
9 










yes 


FL 


451 


0R2AInP 


H38g30 
0 












FL 


452 


0R2F2 


H38g30 
1 






0R7- 

t . TaTt ic cp .u ninccom o 
±;WUvjbv_:n iJUUbbyoJ-U . 

1 




yes 


FL 


A C 1 


UKlr ±Z 


TT O Q _ "3 pi 

H3bg3U 
2 






Qu Jli io . d ; nsbnl -JDr 




yes 


r J-i 


4 o4 


\JJ\ 1 ul rr 


no oy ju 

3 






0R19- 15 




yes 


FL 


455 


OR7G2 


H38g30 
4 




OST260 






yes 


FL 
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SEQ 
ID # 


Symbol 


unp nF 




OST 


Trivial 


Trail 


Int. 


E 


456 


OR1M1 


5 






OR19-6 




ves 


FL 


457 


OR51UnP 


6 














458 


OR52Hn 


H38g30 

7 










yes 


FL 


459 


OR1F1 


H38g30 
8 




OST925 


OLFMF ; OR16 - 3 6 ; OR1 6 - 

^ 7 « HT? 1 £ - ft ft ♦ HP 16- 

89/OR16-90 


+ 


yes 


FL 


460 


ORlOPnP 


H3 8g3 0 
9 














461 


OR4FnP 


H38g31 
0 












FT. 


462 


OR2T1 


H38g31 

1 






UK J. - 




yes 


FL 


463 


OR7EnP 


H38g31 
2 












FL 


464 


OR51Gn 


H38g31 
3 










yes 


FT. 


465 


OR2Tn 


H3 8g31 
4 










yes 


FT. 


466 


OR5BGnP 


H38g31 
5 














467 


OR5WnP 


H38g31 
6 












FT. 


468 


OR51Sn 


H38g31 
7 










yes 


FL 


469 


OR5WnP 


H38g31 
8 














470 


OR51AnP 


H3 8g31 
9 












FTj 


471 


OR5Dn 


H38g32 
0 










yes 


FL 


472 


OR7EnP 


H38g32 
1 












FL 


473 


ORblFn 


H3 8gj^ 

2 










ves 


FL 


474 


OR5Dn 


H38g32 
3 










yes 


FL 


475 


OR52Rn 


H38g32 
4 










yes 


FL 
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SEQ 
ID # 


Symbol 


HURJJc* 


Digi 


OCT 


1I1V a. a -L 


Tran 


Int . 


E 


476 


ORnP 


H38g32 
5 












FL 


477 


OR7EnP 


H38g32 
6 












FL 


478 


OR6Qn 


H38g32 
7 










VPS 


FL 


479 


0R4Fn 


H38g32 
8 










yes 


FT, 


480 


OR7EnP 


H38g32 
9 














481 


OR 7 En 


H38g33 
0 










yes 




482 


OR4Nn 


H38g33 
1 










yes 


r L* 


483 


OR2ASnP 


H38g33 
2 














484 


ORllHn 


H38g33 
3 










yes 


fit 

r L» 


485 


OR2Tn 


H38g33 
4 










yes 


r Li 


486 


OR2TnP 


H38g33 
5 














487 


OR2AKnP 


H38g33 
6 












r Li 


488 


ORnP 


H38g33 
7 














489 


ORSDnP . 


H38g33 
8 












ITT 
r Li 


490 


OR7EnP 


H38g33 
9 














491 


OR5L2 


H38g34 
0 






tlPUTnPDVI c 

HSril fLKAlb 


+ 


yes 


r j_i 


492 


ORSDn 


H38g34 
1 










yes 


r Li 


493 


ORnP 


H38g34 
2 


















H38g34 
3 










yes 


FL 


495 


OR9MnP 


H38g34 
4 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tit sin 


Int . 


E 


496 


DR7E62P 


H38g34 
5 




OST926 


OR2 -4 ; OKZ - , uk^ 
53;OR2-75 






FL 


497 


0R9LnP 


H38g34 
6 












FL 


498 


0R7E46P 


H38g34 
7 




OST37 9 








FL 


499 


OR1S1 


H38g34 
8 




OST034 






yes 


FL 


500 


OR5DnP 


H38g34 
9 














501 


OR9InP 


H38g35 
0 














502 


ORSDn 


H38g35 
1 










yes 


FT. 


503 


OR9QnP 


H38g35 
2 












r Lj 


504 


ORSlCnP 


H38g35 
3 














505 


OR5WnP 


H38g35 
4 














506 


OR9InP 


H38g35 
5 












FT 


507 


OR51AnP 


H38g35 
6 












r xj 


508 


OR5L1- 


H38g35 
7 




OST262 






yes 


FT 


509 


0R7EnP 


H38g35 
8 








+ 






510 


OR5BLnP 


H38g35 
9 














511 


OR51En 


H38g36 
0 










yes 


FL 


512 


OR51Dn 


H3 8g36 
1 










yes 


FT. 


513 


OR52In 


H38g36 
2 










yes 


FL 


C 1 A 

514 




n J oy jo 
3 


DS67 






+ 




FL 


515 


OR52In 


H38g36 
4 










yes 


FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


Uo 1 


1 i XV la ± 


TiTcLn 


Int . 




516 


OR4KnP 


H38g36 
5 












FL 


517 


OR52MnP 


H38g36 
6 












r u 


518 


ORnP 


H38g36 
7 














519 


ORnP 


H38g36 
8 














520 


ORnP 


H38g36 
9 












r 1j 


521 


ORnP 


H38g37 
0 














522 


ORnP 


H38g37 
1 














523 


ORnP 


H38g37 

2 • 














524 


ORnP 


H38g37 
3 














525 


ORnP 


H38g37 
4 














526 


OR6Pn 


H38g37 
5 










yes 


r L) 


527 


OR7EnP 


H38g37 
6 












r J-i 


528 


ORnP 


H38g37 
7 














529 


OR7EnP 


H38g37 
8 












t?T 


530 


ORnP 


H38g37 
9 














531 


ORlOXnP 


H38g38 
0 












r u 


532 


ORlOZn 


H38g38 
1 










yes 




533 


OR6KnP 


H38g38 
2 












FT. 


CIA 


UKO JSJ.1 


iij oy o o 
3 










yes 


FL 


535 


ORlFnP 


H38g38 
4 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OCT 


Tri visl 


Tran 


Int. 


E 


536 


ORlABnP 


H38g38 
5 














537 


OR52MnP 


H38g38 
6 












FL 


538 


ORlXnP 


H38g38 
7 












PL 


539 


OR4FnP 


H38g38 
8 














540 


OR52MnP 


H38g3 8 
9 












FL 


541 


OR2Vn 


H38g39 
0 










yet? 


FL 


542 


OR2V1P 


H38g39 
1 




OST265 








FL 


543 


OR2Zn 


H38g39 
2 










yes 


FL 


544 


OR52KnP 


H38g39 
3 








+ 






545 


ORlOHn 


H38g39 
4 










yes 


FT. 


546 


OR2Dn 


H38g39 
5 










yes 


FL 


547 


OR7EnP 


H3 8g39 
6 














548 


ORllGnP 


H3 8g39 
7 












FL 


549 


ORnP 


H3 8g3 9 
8 














550 


ORllGn 


H3 8g3 9 
9 










yes 


FL 


551 


ORllHnP 


H38g40 
0 












FL 


552 


OR6Kn 


H3 8g4 0 
1 












FL 


553 


ORllHn 


H3 8g40 
2 










yes 


FL 


Dj*i 




H38g4 0 
3 














555 


ORllHnP 


H3 8g4 0 
4 












FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


T2ra.il 


JL11L- . 


F 


556 


OR6KnP 


H38g40 
5 












PL 


557 


OR6Kn 


H38g40 
6 










yes 


FT. 


558 


OR2Ln 


H38g40 
7 










yes 


FT. 


559 


OR4GnP 


H38g40 
8 














560 


OR6Nn 


H38g40 
9 










yes 




561 


OR2LnP 


H38g41 
0 















562 


OR9A1 


H38g41 
1 






HSHTPCRX06 








563 


OR6Nn 


H38g41 
2 










yes 


FL 


564 


ORlOHn 


H38g41 
3 










yes 


FL 


565 


OR7EnP 


H3 8g41 
4 












FL 


566 


OR2AQnP 


H38g41 
5 














567 


OR2LnP 


H38g41 
6 












FL 


568 


OR5ARn 


H38g41 
7 










yes 


FL 


569 


OR7EnP 


H38g41 
8 












FL 


570 


ORlOAAn 
P 


H3 8g41 
9 














571 


ORlOJnP 


H38g42 
0 












FL 


572 


OR5A1P 


H38g42 
1 


DS69;D 
S71;DS 
128 ;DS 
129 


OST181 




+ 


yes 


FL 


573 


OR2AHnP 


H3 8g42 
2 












ITT 
r Li 


574 


ORlOJnP 


H38g42 
3 












FL 


575 


OR56BnP 


H38g42 












FL 
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SEQ 
ID If 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int. 


E 






4 














576 


OR5M1 


H38g42 
5 




Ob 1 UbU 






ves 


FL 


577 


OR52WnP 


H38g42 
6 














578 


OR5AMnP 


H38g42 
7 












FL 


579 


OR52BnP 


H38g42 
8 












FL 


580 


ORSMnP 


H38g42 
9 












FL 


581 


ORSAPnP 


H38g43 
0 












FL 


582 


OR56Bn 


H38g43 
1 










yes 


FL 


583 


OR5APn 


H38g43 
2 










yco 


FL 


584 


OR52Bn 


H38g4 3 
3 










yes? 


FL 


585 


OR9Gn 


H38g43 
4 










yes 


FL. 


586 


OR52Kn 


H38g43 
5 










yes 


FL 


587 


OR5MnP 


H38g43 
6 












FL 


588 


OR52Kn 


H38g4 3 
7 










yes 


FL 


589 


OR52KnP 


H38g43 
8 








+ 




FL 


590 


OR52BnP 


H38g43 
9 












FL 


591 


OR2B6P 


H38g44 
0 






OR6 - J 1 






FL 


592 


OR2WnP 


H38g44 

1 












FL 


593 


OR2AnP 


H38g44 

2 












FL 






H38g44 

3 














595 


OR2LnP 


H38g44 
4 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OCT 


Trivial 


Tran 


Int . 


E 


596 ( 


DR2W2P 


K38g44 
5 


DS148 




H.T^i ^Tfi 9«hs6Ml-30P 


+ 




FL 


597 


DR2LnP 


H38g44 
6 














598 


0R2B7P 


H38g44 
7 






H.Tll^TA ^'hs6Ml-31P 






FL 


599 


OR2Ln 


H38g44 
8 












FL 


600 


ORSBFn 


H38g44 
9 












FL 


601 


OR2LnP 


H38g45 
0 












FT, 


602 


OR7EnP 


H38g45 
1 














603 


OR1H1 


H38g45 
2 


DS122 


0ST26 




+ 




FT, 


604 


ORnP 


H38g4 5 
3 














605 


OR4Dn 


H38g4 5 
4 










yes 


FT. 


606 


ORILn 


H38g45 
5 










yes 


FTj 


607 


OR5AXn 


H38g45 
6 










yes 


FT, 


608 


OR5An 


H38g45 
7 










yes 


FT., 


609 


OR5AYn 


H38g45 
8 










yes 


FL 


610 


OR13Gn 


H38g45 
9 










\rc* c 
y CJD 


FL 


611 


OR5BBnP 


H38g46 
0 














612 


OR9GnP 


H38g46 
1 












FL 


613 


OR2TnP 


H38g46 

2 












FL 


Di4 


\JX\J.Xtr 


3 












FL 


615 


ORlJn 


H38g46 
4 








+ 


yes 


FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 




Trivial 


Tran 


Int. 


E 


616 ( 


3R2CnP 


H38g46 
5 












FL 


617 


DR9GnP 


H38g46 
6 












FL 


618 


0R2C1 


H38g46 
7 






OLFmf 3 


+ 


yes 


FL 


619 


OR51AnP 


H38g46 
8 














620 


OR9Gn 


H38g46 
9 










yes 


FL 


621 


OR52Bn 


H38g47 
0 










yes 


FL 


622 


OR1K1 


H38g47 
1 






ngyy 




yes 


FL 


623 


ORBlRnP 


H38g47 
2 












FL 


624 


OR7EnP 


H38g47 
3 












FL 


625 


OR52PnP 


H38g47 
4 












FL 


626 


OR7EnP 


H38g4 7 
5 












FL 


627 


OR7EnP 


H38g47 
6 














628 


OR4KnP 


H38g47 
7 


DS66 




AT) O 1 - 1 


+ 




FL 


629 


OR4KnP 


H38g47 
8 












FL 


630 


OR7EnP 


H38g47 
9 














631 


OR51In 


H38g48 
0 










yes 


FL 


632 


ORSlIn 


H38g48 
1 










yes 


FL 


633 


OR2AnP 


H38g48 
2 














634 


OR2A2 


H38g48 
3 




OST008 








FL 


635 


OR2AnP 


H38g4 8 
4 












FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 




Trivial 


Tran 


Int. 


E 


636 ( 


}R2Gn ] 


K38g4 8 
5 










yes 


FL 


637 ( 


3R2AnP 


H38g4 8 
6 














638 


DR6Fn 


H38g4 8 
7 


DS20;D 
S21;DS 
23;DS2 
7;DS28 
;DS39; 
DS40;D 
S113/D 
S126;D 
S135;D 
S137/D 
S138;D 
S139;D 
S140/D 
S141;D 
S145 






+ 


yes 


FL 


639 


OR2AnP 


H38g48 
8 














640 


OR2Gn 


H38g48 
9 










yes 


FL 


641 


OR7E37P 


H38g49 
0 








+ 




FL 


642 


OR5AVn 


H38g49 
1 


DS4 ;DS 
6 ;DS11 








ves 


FL 


643 


OR2AJnP 


H38g49 
2 












FL 


644 


OR13EnP 


H38g49 
3 












FL 


645 


OR2Cn 


H38g49 
4 










yes 


FL 


646 


OR2TnP 


H38g49 
5 















647 


OR2WnP 


H38g4 9 
6 














648 


OR13Jn 


H38g49 
7 










yes 


FL 


649 


0R6RnP 


H38g49 
8 












FL 


650 


ORSATn 


H3 8g4 9 
9 










yes 


FL 
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SEQ £ 
ID # 


Jymbol 


HORDE 


Digi u^i 


Trivial 


rran 


Int. 


E 


6 51 0 


R2Zn H 
0 


38g50 








yes 


FL 


652 C 


R4Ln E 
|l 


I38g50 








yes 


FL 


653 C 


)R4UnP \l 


I38g50 

I 










FL 


654 C 


)R4Fn I 


138g50 
3 








yes 


FL 


655 < 


3R4FnP 1 

1 


*38g50 
4 










FL 


656 < 


DR4Fn 


H38g50 
5 








yes 


FL 


657 


0R4Fn 


H38g50 
6 








yes 


FL 


658 


OR4AnP 


H38g50 
7 










FL 


659 


0R4LnP 


H38g50 
8 










FL 


660 


OR7E33P 


H38g50 
9 


OST927 


UnC Q O 

ngo o o 






FL 


661 


OR2Cn 


H38g51 
1° 








yes 


FL 


662 


OR4Kn 


H38g51 
|l 








yes 


FL 


663 


OR5U1 


H38g51 
|2 




HAi 5 0 A6 4 ; hs 6M1 - 2 8 




yes 


FL 


664 


pR4Kn 


H38g51 
3 








yes 


' FL 


665 


OR5V1 


H38g51 
|4 




biLi ^OAS 2;hs6Ml-21 




yes 


FL 


666 


OR4QnP 


H38g51 
[5 










FL 


667 


OR12D3 


H38g51 
6 




bA150A6 . 1 ;hs6Ml-27 




yes 


FL 


668 


OR4Kn 


H38g51 
I 7 








yes 


FL 


669 


ORSlCn) 


? H38g51 
I 8 














670 


OR1J2 


H38g51 
I 9 




OST044 


hgl52 




yes 


FL 
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SEQ I 
ID # 


Symbol 


HORDE 


Digi 


/-> p rp 


Trivial 


Tran 


Int. 


E 


671 C 


R5BJnP \V 

|c 


I38g52 
) 














672 C 


>R1J1 * 
[] 


I38g52 I 

I 


5S130 C 






+ 


yes 


FL 


673 C 


)R13En I 


138g52 
l_ 










put 




674 ( 


}R4KnP N 


138g52 1 
3 


0S1 






+ 




FL 


675 < 


DRILnP 1 


H38g52 
4 














676 


OR2CnP 


H38g52 
5 














677 


OR4TnP 


H38g52 
6 












FL 


678 


OR5BnP 


H38g52 
7 














679 


OR4Kn 


H38g52 
8 










yes 


FL ! 


680 


ORllLn 


H38g52 
9 










yes 


FL 


681 


OR7E68P 


H38g53 
0 




OST929 


OR912-108;OR912- 

i r\ q . noQi T.n n ■ OR912 - 

46;hg523;hg674 






FL 


682 


OR7EnP 


H38g53 
|l 












FL 


683 


OR7E31P 


H38g53 
2 




OST016 ;0 
|ST205 








FL 


684 


OR7EnP 


H38g53 
I 3 












FL 


685 


OR5AKnP 


H38g53 
I 4 












FL 


686 


ORSAKn 


H38g53 
5 










yes 


FL 


687 


OR5AKn 


H38g53 
I 6 










yes 


FL 


688 


OR5BQn£ 


> H3 8g53 
I 7 














689 


ORINn 


H38g53 
I 8 


DS136; 
DS142 






+ 


yes 


FL 


690 


0RU4 


H38g53 
|9 




OST930 


HSHTPCRX01 


+ 


yes 


FL 
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SEQ 
ID # 


Symbol 


v«An ^^^^ 

HORDE 


Digi 


Uo 1 


Trivial 


Tran 


Int. 


E 


691 ( 


DRINn I 


438g54 

D 










yes 


FL 


692 1 


DR2AnP ] 


H38g54 
1 












FL 


693 


0R2ANnP 


H38g54 
2 














694 


OR5K1 


H38g54 
3 






W^HTPCPXIO 
non i r^AAJi w 


+ 


yes 


FL 


695 


OR2K2 


H38g54 
4 






UQWTPrPHflfi 
Hon 1 lr\~.ns\\J vJ 




yes 


FL 


696 


OR8Hn 


H38g54 
5 










yes 


FL 


697 


ORnP 


H3 8g54 
6 














698 


OR4AnP 


H38g54 
7 














699 


OR4An 


H38g54 
8 










yes 


FL 


700 


OR6Sn 


H38g54 
9 










yes 


FL 


701 


OR4RnP 


H38g55 
0 














702 


OR13Cn 


H38g55 
1 










yes 


FL 


703 


OR13DnP 


H38g55 
2 












FL 


704 


OR7EnP 


H38g55 
3 












FL 


705 


ORlOPnP 


H38g55 
4 












FL 


706 


OR8In 


H38g55 
5 










yes 


FL 


707 


OR8G1 


H38g55 
6 








+ 


put 




708 


ORnP 


H38g55 
7 














709 


OR5F1 


H3Bg55 
8 






OR11-10 




yes 


FL 


710 


ORSFnP 


H38g55 
9 












FL 
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SEQ I 
ID # 


Symbol 


HORDE 


Digi 




Trivial 


Tran 


Int. 


E 


711 C 


R6BnP H 

C 


I38g56 












FL 


712 C 


JR2D1 * 


I38g56 

L 




K 


ig27 




put 




713 C 


)R5ASn I 


I38g56 

I 










yes 


FL 


714 ( 


}R5SnP 1 


*38g56 
3 












FL 


715 < 


DR5AQnP ) 


H38g56 
4 














716 


OR6BnP 


H38g56 
5 












FL 


717 


OR5JnP 


H38g56 
6 












FL 


718 


OR9AnP 


H38g56 
7 












FL 


719 


ORSBEnP 


H38g56 
8 












FL 


720 


OR9An 


H38g56 
9 










yes 


FL 


721 


OR8Hn 


H38g57 
0 










yes 


FL 


722 


ORSBNnP 


H38g57 
1 














723 


OR8Jn 


H38g57 
2 










yes 


FL 


724 


OR9NnP 


H38g57 

3 














725 


OR7EnP 


H38g57 
4 












FL 


726 


OR7E9P 


H38g57 
5 












FL 


727 


OR8KnP 


H38g57 
6 














728 


OR2AnP 


H38g57 
7 














729 


OR8Kn 


H38g57 
8 










yes 


FL 


730 


OR7E391 


P H38g57 
9 




OST931 


hg611 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 




Trivial 


Tran 


Int. 


E 


731 ( 


DR7E27P 1 


ci38g58 
0 




ijo l y -3 6 


hg616 








732 ( 


2R2Hn 


H38g58 
1 










put 




733 


0R13CnP 


H38g58 
2 






i 






FL 


734 


OR13Cn 


H38g58 
3 










yes 


FL 


735 


OR2S1P 


H38g58 
4 




OST611 








FL 


736 


OR2AMnP 


H38g58 
5 














737 


OR1N1 


H38g58 
6 




OST933 


OKI "ZD 




DUt 




738 


OR2S2 


H38g58 
7 




OST715 






ves 


FL 


739 


OR7E26P 


H38g58 
8 






OR1- bl ; UK1- /*5 i UK1 
73/OR912-95 








740 


OR1F11 


H38g58 
9 






hg91 




DUt 




741 


ORSACnP 


H38g59 
0 












FL 


742 


OR5B10P 


H38g59 
1 






OR13 - 34 ; UK J. 0 - 
64;OR13-67 








743 


OR2AnP 


H38g59 
2 












FL 


744 


OR1E5 


H38g59 
3 


DS117 ; 
DS143 




ORlJ- 0 b 


+ 


DUt 




745 


OR4 Fn 


H38g59 
4 










ves 


FL 


746 


OR5CnP 


H38g59 
5 














747 


OR2WnP 


H38g59 
6 














748 


OR2L2 


H38g59 
7 








+ 


put 




749 


OR4H8P 


H38g59 
8 






OR14-58 








750 


OR5D10E 


H38g59 
9 






OR912-94 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


APT 


i llVlal 


Tran 


Int . 


E 


751 


OR7A12P 


H38g60 
0 






UK14 - J-l , UK-L^ D -J 








752 


OR2L1 


H38g60 
1 






Hon 1 FLKAUz 




put 




753 


OR2F3P 


H38g60 
2 






UK14 -bU 




put 




754 


OR4H10P 


H38g60 
3 




OST934 


OR15-6 9 ;ORlb- 
80/OR15-81 








755 


OR5H1 


H38g60 
4 






HSHTPCRX14 


+ 


put 





756 


OR2K1 


H38g60 
5 






HSHTPCRX17 


+ 


put 




757 


OR7E11P 


H38g60 
6 






0R11-2 








758 


OR7A3P 


H38g60 
7 




OST93 5 


0Rll-7b 








759 


OR6A1 


H38g60 
8 






OR11-55 


+ 


yes 


r JLj 


760 


OR5I1 


H38g60 
9 






0LF1 


+ 


yes 


FL 


761 


OR2H3 


H38g61 
0 






HUMORLMHC 


+ 


yes 


r J-i 


762 


OR10J1 


H38g61 
1 


DS3;DS 
14 




HSHGMP07 J 


+ 


yes 


r J-i 


763 


OR7E3P 


H38g61 

2 






0R11-9 








764 


OR1D6P 


H38g61 
3 






OR11-13 ;0Rll-22 








765 


OR5D10P 


H38g61 
4 






0R18- 17 /0R18 - 
42;OR18-43;OR18-44 








766 


OR5D5P 


H38g61 
5 






OR18- 79 ;OR912-47 








767 


OR52A1 


H38g61 
6 






HPFHIOK 


+ 


yes 


FT. 


768 


OR2AEn 


H38g61 
7 












r u 


/ d y 




njoyoi 

8 












FL 


770 


OR6LnP 


H38g61 
9 












FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


iriviai 


Tran 


Int . 


E 


771 ( 


}R7MnP 1 


138g62 
0 














772 ( 


DR13Cn 


H38g62 
1 










ves 


FL 


773 ( 


DR13Cn 


H38g62 
2 










ves 


FL 


774 


0R2InP 


H38g62 
3 








+ 






775 


OR4An 


H38g62 
4 










yes 


FL 


lie 


OR2InP 


H38g62 
5 








+ 






111 


OR4AnP 


H38g62 
6 












FT. 


778 


OR4AnP 


H38g62 
7 












FT. 


IIS 


OR8C1P 


H38g62 
8 






OR11-175 








780 


OR4AnP 


H38g62 
9 












FT. 


781 


OR7E15P 


H38g63 
0 






OR11-392 








782 


OR10A1 


H38g63 
2 






OR11-403 




put 





783 


OR2An 


H38g63 
3 








+ 


put 




784 


OR7EnP 


H38g63 
4 








+ 




FL 


785 


OR7En 


H38g63 
5 








+ 


put 




786 


OR51A1P 


H38g63 
6 






HPFH60R 


+ 




FL 


787 


OR7E47P 


H38g63 
7 






HSORBPL41 ;bpl41-16 


+ 




FL 


788 


OR5B5P 


H38g63 
8 






OR3-144 ;OR912- VI 








789 


ad 1 n n 


njoyo j 

9 






OR3-145 




put 




790 


OR8G2 


H38g64 
0 






HSTPCR120 


+ 


put 





85 



WO 01/27158 



PCI7US00/27582 



SEQ 
ID # 


Symbol 


HORDE 


Digi 


APT 


Trivia 1 


Tran 


Int . 


E 


791 


ORlSn 


H38g64 
1 










ves 
j 


FL 


792 


OR4AnP 


H38g64 
2 












FL 


793 


OR4AnP 


H38g64 
3 












FL 


794 


OR4AnP 


H38g64 
4 












FL 


795 


OR4AnP 


H38g64 
5 












FL 


796 


OR4AnP 


H38g64 
6 












FL 


797 


OR4AnP 


H38g64 
7 












FL 


798 


OR4An 


H38g64 
8 










yes 


FL 


799 


OR4An 


H38g64 
9 










yes 


FL 


800 


OR7E42P 


H38g65 
0 




OST001 










801 


0R2M3P 


H38g65 
1 




OST003 










802 


OR4H11P 


H38g65 
2 






OR4-114 ;OR4 -lib ;UK4- 
119 








803 


OR7E57P 


H38g65 
3 




OST007 










804 


OR2B1P 


H38g65 
4 






OR5 -40; OR5 - 4 1 








805 


OR7E34P 


H38g65 
5 




OST011 










806 


OR7E56P 


H38g65 
6 




OST013 










807 


OR3AnP 


H38g65 
7 














808 


OR4H5P 


H38g6 5 
8 






OR5-39;OR5-84 








809 


ORlEn 


H38g65 
9 


DS47;D 
S115;D 
S120;D 
S121;D 
S123;D 






+ 


put 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Trail 


Int. 


E 








3125 












810 ( 


}R51CnP ] 


*38g66 
0 














811 ( 


DR2WnP 


H38g66 
1 












FL 


812 


DR51B1P 


H38g66 
2 






AF149710 






FL 


813 


OR7E81P 


H38g66 
3 




OST021 










814 


OR7E44P 


H38g66 
4 




OST022 










815 


OR5B7P 


H38g66 
5 






OR6-55;OR6-57 








816 


OR7E36P 


H38g66 
6 




OST024 










817 


OR2A5 


H38g66 
7 






OR7-138;OR7-141 




put 




818 


OR5B1P 


H38g66 
8 




OST936 


OR8-122/OR6-123 








819 


OR8B8 


H38g66 
9 






HSTPCR8 5 


+ 


yes 


TTT 


820 


OR8B4P 


H38g67 
0 






AC002556-D 




yes 


r l-i 


821 


ORnP 


H38g67 
1 












r Xj 1 


822 


OR8B3 


H38g67 
2 






AC002556-B 




yes 


r Lt 


823 


OR2Bn 


H38g67 
3 










yes 


"FT. 


824 


OR8B6P 


H38g67 
4 






AC002556-G 






X" Xj 


825 


OR8B5P 


H38g67 
5 






AC002556-A 






VT. 


826 


OR4E2 


H38g67 
6 






AE000658-A 




yes 


FL 


827 


OR8B7P 


H38g67 
7 






AC002556-F 






FL 


828 


ORllJnJ 


' Hjoyo / 
8 












FL 


829 


OR4E1P 


H38g67 
9 






AE000658 






FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


int. . 


Ct 


830 C 


)R10DnP I 
( 


J38g68 

3 














831 ( 


DRnP 1 


-I38g6 8 
L 














832 ( 


DR8D2 ] 


S38g68 
2 






AC002556-E 




yes 


FT. 


833 


ORllInP 


H3 8g68 
3 












CT 

r u 


834 


ORHJnP 


H38g68 
4 












FL 


835 


ORlOAnP 


H38g68 
5 


DS12;D 
S65 






+ 




FL 


836 


OR8C3P 


H3 8g68 
6 






OR912-106;OR912- 
45;pDJ9jl4 






FL 


837 


OR2DnP 


H38g68 
7 












FL 


838 


OR4PnP 


H38g68 
8 














839 


OR7E21P 


H38g68 
9 




OST035 


0R4DG 








840 


OR2M1 


H38g69 
0 




OST037 






put 




841 


OR7AnP 


H38g69 
1 














842 


OR5D11P 


H38g69 
2 






OR8-125;OR8-127 








843 


OR7E50P 


H38g69 
3 






OR8-126 








844 


OR7E45P 


H38g69 
4 




OST04 9 










845 


OR7E77P 


H38g69 
5 




OST060 










846 


OR8B2 


H3 8g6 9 
6 






AC002556-C 




yes 


r Xj 


847 


OR8D1 


H38g69 
7 




OST004 


pDJ9jl4 




yes 


r Li 


848 


OR8B1P 


H38g69 
8 




uo i y j / 


OR11- 561 






FL 


849 


OR7A1P 


H38g69 
9 




OST938 


OLF4p;OR19-3;hg513 






FL 
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ID # 


oyillLAJX 






OST 


Trivial 


Tran 


Int. 


£ 


OCA 


UK / CMOf 


0 






ORll-lla;pDJ3 92al7 






FL 


QC1 

COX 


HD A TVn D 


1 












FL 


QC'l 


UK / CtO Uir 


2 




OST939 


pDJ392al7 






FL 


O C "3 

853 


UK4Unir , 


U"l Qrr*7 fi 

nJoy ' U 
3 












FL 


854 


OK7E1UP 


Hi oy / U 
4 






AC000385-A 






FL 


855 


OR10B1P 


HJog / u 
5 






AP003 956 -A-OR19-19 






FL 


856 


0R2InP 


HJ8g/U 
6 














857 


0R4Dn 


H38g70 
7 










ves 


FL 


858 


0R5ACn 


H38g70 
8 










put 




859 


OR2I1 


H38g70 
9 






AC004179- 

A • riiT? 7 1 M2 1 7*hs6Ml- 
14 


+ 






860 


ORlOHl 


H38g71 
0 










ves 


FL 


861 


OR7E59P 


H38g71 
1 




UO 111? 










862 


OR7E2 8P 


H38g71 
2 














863 


OR5B3 


H3 8g71 
3 




vJo 1 1Z3 






DUt 




864 


OR2A6 


Hi sg / i 

4 










put 




o c c 

865 


UKoun 


nj og / 1 
5 










put 




ace 
ODD 


UK /c*b4P 


nj by / X 
6 














oo / 


UK / r*<± o ir 


7 




OST193 










868 


OR6 7AnP 


H38g71 
8 












FL 


869 


0R4DnP 


H38g71 
9 












FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


i 11 Via J. 


Trsm 


Int . 


E 


870 


0R4CnP 


H38g72 
0 












FL 


871 


0R4DnP 


H38g72 
1 












FL 


872 


OR10H2 


H38g72 
2 






ACUU4 by/ -A 




ves 

2 


FL 


873 


OR10H3 


H38g72 
3 






AC004597-B 


+ 




FL 


874 


OR55CnP 


H38g72 
4 














875 


OR55BnP 


H38g72 
5 














876 I 


OR52VnP 


H38g72 
6 












"FT 


877 


OR2B3 


H38g72 
7 






0R6- 

4;dJ80I19.1;hs6Ml-l 




yes 


r Lj 


878 


OR52TnP 


H38g72 
8 














879 


OR2J1P 


H38g72 
9 






OR6- 

5;dJ80I19. 2 ;hs6Ml-4 








880 


OR52HnP 


H38g73 
0 












r ju 


881 


OR2J3 


H38g73 
1 






OR6- 

6;dJ80I19. 7;hs6Ml-3 




yes 


r JLi 


882 


OR52An 


H38g73 
2 








+ 


put 




883 


OR4Qn 


H38g73 
3 










put 




884 


OR52BnP 


H38g73 
4 












r Li 


885 


OR2N1P 


H38g73 
5 


DS9 




OR6- 

7;dJ80I19. 3;hs6Ml-2 


+ 




FT. 


886 


ORSlEnP 


H38g73 
6 








+ 






887 


OR2J2 


H38g73 
7 






OR6 - 

8;dJ80I19.4;hs6Ml-6 






FL 


Q Q Q 

boo 


UKzin 


8 








+ 


put 




889 


OR2J4P 


H38g73 
9 






OR6- 

9;dJ80I19. 5;hs6Ml-5 






FL 
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SEQ 
ID # 


Symbol 


ri\Jt\±JS~j 


Hi cri 


OST 


Trivial 


Tran 


Int. 


E 


890 


f~\Ti TO A A 

0R7fcj4UF 


0 




0ST2 15 










891 


OR2H4P 


H38g74 
1 






OR6 - 

3;dJ80I19.6;hs6Ml-7 






FL 


892 


OR7E52P 


H3 og /4 
2 














893 


0R2InP 


H38g74 
3 








+ 






894 


0R6C1 


H38g74 
4 




Ob 1 b / 






put 




895 


OR7E3 0P 


H38g74 

5 




os i j jy 










896 


ORSBAnP 


H38g74 
6 


DS132 






+ 






897 


OR7H1P 


H38g74 

7 




OST34 U 








FL 


898 


OR5B2 


H38g74 
8 




OSTO / -5 






ves 


FL 


899 


OR5AZnP 


H38g74 
9 












FL 


900 


0R5Bn 


H38g75 
0 










ves 


FL 


901 


0R52Bn 


H38g75 
1 










ves 


FL 


902 


ORSBnP 


H38g75 
2 












FL 


903 


OR52Dn 


H38g75 
3 










ves 


FL 


904 


0R7A11 


H38g75 
4 




Ob 1 1 


rTT-H<?P-8 7ml7 

V_ J. x nor 0 / im. / 






FL 


905 


OR5BnP 


H38g75 
5 












FL 


906 


ORSlAnP 


H38g75 
6 












FL 


907 


OR7A15P 


HJog /d 
7 






CIT-HSP-87ml7 ;0R19- 
l;OR19-134;OR19-146 






FL 


908 


OR7C2 


H38g75 
8 






CIT-HSP-87ml7;ORl9- 
18 




yes 


FL 


909 


OR7E23P 


H38g75 
9 




OST942 


OR21-3 






FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int . 


E 


910 C 


>R2E1 F 
C 


I38g76 
) 




I 


iS2 9Kl ;HSNH0bbyi^^ , n 
36M1-9 








911 C 


)R1I1 I 

] 


I38g76 

L 






F2 0569 ;OR19-20 




yes 


FL 


912 C 


)R1RI1P 1 


i38g76 

2 












FL 


913 ( 


DR4F3 1 


*38g76 
3 






AC004908 




ytss> 


FL 


914 < 


DR2AEn ) 


K38g76 
4 










VP Q 

yes 


FL 


915 


OR2InP 


H3 8g76 
5 








+ 






916 


OR52AnP 


H38g76 
6 








+ 






917 


OR7C1 


H38g76 
7 




OST943 


CIT-HSP-146e8;OR19- 
5/TPCR86 




yes 




918 


OR2A3P 


H38g76 
8 






AC004889-B 






FL 


919 


OR7A5 


H38g76 
9 


DS8;DS 
19;DS6 
1;DS68 
;DS112 


OST944 


HTPCR2 


+ 


yes 


FL 


920 


OR2InP 


H38g77 
0 


DS72 






+ 






921 


OR7A10 


H38g77 
1 




OST027 


CIT-HSP-146e8 




yes 


FL 


922 


OR2An 


H38g77 
2 








+ 


put 




923 


OR2M2 


H38g77 
3 




OST423 






put 




924 


OR7A8P 


H38g77 
4 




OST042 


OR19-ll;hg83 






FL 


925 


OR2An 


H38g77 
5 








+ 






926 


OR7E20J 


» H38g77 
6 




OST516 










927 


OR2AnP 


H38g77 
7 








+ 






928 
92 9 


ORSBHnI 
ORlEn 


> H38g77 
8 

H38g77 








+ 


put 





92 
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PCT/US00/27582 



SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int. 


E 






9 














930 < 


ORlEnP 


H38g78 
0 














931 


0R5Bn 


H38g78 
1 










yes 


FT, 


932 


OR8RnP 


H38g78 
2 














933 


OR5ANn 


H38g78 
3 










yes 


r Jj 


934 


ORSANnP 


H38g78 
4 












r Lt 


935 


OR5BRnP 


H38g78 
5 












tJT 

FL 


936 


OR2A1 


H38g78 
6 






AC004889-A 


+ 


yes 


FL 


937 


ORlOAn 


H38g78 
7 










yes 


FL 


938 


OR2A9 


H38g78 
8 


DS149 




HSDJ0798C17 


+ 




FL 


939 


OR2A7 


H38g78 
9 






HSDJ0798C17 


+ 


yes 


FL 


94 0 


OR10A3 


H38g79 
0 






HSHTPCRX12 


+ 


yes 


FL 


941 


ORlOCn 


H38g79 
1 










yes 


FL 


942 


OR7A2P 


H38g79 
2 






OLF4p ; OR1 9-18;hgl003 




yes 


FL 


943 


ORlOWnP 


H38g79 
3 












r Li 


944 


OR7A17 


H38g79 
4 






HSHTPCRX19 




yes 


r U 


945 


OR5Bn 


H38g79 
5 










yes 


r Li 


946 


ORSBnP 


H38g79 
6 












ITT 


947 


OR1Q1 


H38g79 
7 




OST226 


HSTPCR106;OR9- 
A;hRPK-4 65_F_21 


+ 


yes 


FL 


948 


OR2Hn 


H38g79 
8 


DS144; 
DS150 






+ 


yes 


FL 


949 


OR7EnP 


H38g79 












FL 
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SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int. 


E 






9 














950 


DR7A14 


H38g80 
0 




OST945 


OR19-12 








951 


0R1B1 


H38g80 
1 






OR9-B ; hRPK-46 5_F_21 1 




yes 


FT, 


952 


OR12D2 


H38g80 
2 






AC004171;dJ994E9 . 8;h 
S6M1-20 


+ 


yes 


TTT 


953 


OR7EnP 


H38g80 
3 












r J-i 


954 


OR8BnP 


H38g80 
4 












r Lj 


955 


OR1L1 


H38g80 
5 






OR9-C;hRPK- 
465 F_21;hg23 




yes 


FL 


956 


ORllAn 


H38g80 
6 










yes 


FL 


957 


OR7AnP 


H38g80 

7 














958 


OR1C1 


H38g80 
8 






HSTPCR27 


+ 


yes 


FL 


959 


OR1D2 


H38g80 
9 




OST946 


OR17-4 




yes 


FL 


960 


OR1L3 


H38g81 
0 






OR9-D; hRPK- 4 6 5_F_2 1 




yes 


FL 


961 


OR12DnP 


H38g81 
1 












FL 


962 


OR4G1P 


H38g81 
2 






OLB 






FL 


963 


OR2B4P 


H38g81 
3 






AL050339- 

A;dJ974Ill. l;hs6Ml- 
22 








964 


OR11H1 


H38g81 
4 






OR22-1 




yes 


r Li 


965 


OR4 Fn 


H38g81 
5 










yes 


r Li 


966 


OR56AnP 


H38g81 
6 














967 


OR8NnP 


H38g81 
7 












FT. 


968 


OR7EnP 


H38g81 
8 














969 


OR4Pn 


H38g81 










yes 


FL 



94 



WO 01/27158 



PCT/US00/27582 



SEQ I 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int. 


E 


970 C 


S 

>R6Cn I 
( 


I38g82 
) | 










put 




971 C 


)R5BCnP I 


138g82 
L J 














972 C 


)R10QnP ] 


■I38g82 1 

2 [ 


DS64 






+ 




FL 


973 C 


DRSBnP ] 


K38g82 
3 












FL 


974 < 


DRIOPnP 


H38g82 
4 1 












FL 


975 < 


DR1L4 


H38g82 
5 [ 




OST046 


OR9 - E ; hRPK- 4 6 5_F_2 1 




ves 


FL 


976 


0R2APnP 


H38g82 

6 1 














977 | 


OR1L6 


H38g82 
7 




OST947 


HShRPK-4 65_F_21 ;hgl6 






FL 


978 


OR6UnP 


H38g82 
8 












FL 


979 


OR5C1 


H38g82 
9 






OR9 - F ; hRPK- 4 6 5_F_2 1 






FL 


980 


ORHInP 


H38g83 
0 












FL 


981 


OR4AnP 


H38g83 
1 












FL 


982 


OR4GnP 


H38g83 
2 












FL 


983 


ORlOVn 


H38g83 
3 










ves 


FL 


984 


OR4G2P 


H38g83 
4 






HS14a-l-B 






FL 


985 


ORlOVnP 


H38g83 
5 














986 


OR4F4 


H38g83 
6 






HS14a-l-A 




ves 


FL 


987 


OR4G3P 


H38g83 
7 






OLC- 7 501 






FL 


988 




) H**8a83 

n j o^j o — » 

8 












FL 


989 


ORlOYnl 


3 H38g83 
9 












FL 



95 



PCT/USOO/27582 

WO 01/27158 



SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 




Tran 


Int. 


E 


990 C 


)R4GnP I 

( 


138g84 

3 












FL 


991 C 


DRnP 3 


i38g84 
L 














992 < 


}R4Fn 1 


K3 8g84 

2 










yes 


FL 


993 < 


DR8A1 


H38g84 
3 




OST025 






yes 


FL 


994 


0R8Bn 


H38g84 
4 










ves 


FL 


995 


OR6DnP 


H38g84 
5 














996 


OR7E14P 


H38g84 
6 




OST94 8 


OR11-5 






FL 


997 


OR2M4 


H38g84 
7 




OST710 


HSHTPCRX18 




put 




998 


OR4WnP 


H38g84 
8 














999 


OR4Fn 


H38g84 
9 


DS36 










FL 


1000 


OR7EnP 


H38g85 
0 














1001 


OR4GI1P 


H38g85 
1 












FL 


1002 


ORlOJnP 


H38g85 
2 














1003 


OR52En 


H38g85 
3 










yes 


FL 


1004 


OR4RnP 


H38g85 
4 












FL 


1005 


OR4Cn 


H38g85 
5 










ves 


FL 


1006 


OR4AnP 


H38g85 
6 















1007 


OR4AnP 


H38g85 
7 


DS54 






+ 






1 DOR 

1UUO 




H38g85 
8 












FL 


1009 


OR9Gn 


H3 8g8 5 
9 










yes 


FL 



96 



WO 01/27158 



PCT/US00/27582 



SEQ 

id # 


Symbol 


HORDE 


Dig i 


OST 


Trivial 


Tran 


Int. 


E 


1010 c 


)R10An I 

C 


[38g86 

) 










yes 


FL 


1011 c 


}R4Cn J 


138g86 
L 










yes 


FL 


1012 ( 


DRIOVnP 3 


•i38g86 
2 














1013 ( 


DRIOUnP 1 


H38g86 
3 












FL 


1014 


OR7E2P 


H38g86 
4 


DS127 




HP 11-6* ha94 


+ 




FL 


1015 


OR7E35P 


H38g86 
5 




OSTOlo 








FL 


1016 


OR9KnP 


H38g86 
6 














1017 


OR7E13P 


H38g86 
7 




OST94 9 


\Jt\ ii * 






FL 


1018 


OR7EnP 


H38g86 
8 












FL 


1019 


OR9Kn 


H38g86 
9 










yes 


FL 


1020 


ORnP 


H38g87 
0 












FL 


1021 


OR7EnP 


H38g87 
1 




OST9S0 


rvpi i - T -hcrSOO 
uKii J- / iA y ~> u \j 


+ 




FL 


1022 


OR7EnP 


H38g87 
2 












FL 


1023 


OR3A4P 


H38g87 
3 




OST951 


npi 7-94 -OR17 -2 5 


+ 


yes 


FL 


1024 


OR8QnP 


H38g87 
4 














1025 


OR7EnP 


H38g87 
5 












FL 


1026 


OR7EnP 


H38g87 
6 












FL 


1027 


OR3A1 


H38g87 
7 


DS2 




OLFRA03 ;0R17- 
40;hgl38 


+ 


yes 


FL 


1028 


OR5GI1 


H38g87 
8 










yes 


FL 


102S 


) OR5MnP 


H38g87 
9 















97 



WO 01/27158 



PCT/US00/27582 



SEQ 
ID « 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tira.n 


Tn t- 
J.I1L . 


Ei 


1030 


DR7EnP 


H38g88 
0 












r Lt 


1031 


OR5G1P 


H38g88 
1 




OST952 


ORll- 

104;OR93;OR93Hum 






CT 
r Li 


1032 


OR5PnP 


H38g88 
2 












TTT 

r JU 


1033 


ORlOAEn 
P 


H38g88 
3 














1034 


OR3A2 


H38g88 
4 




OST953 


OR17-228 


+ 


yes 


FL 


1035 


ORlOJn 


H38g88 
5 










yes 


FL 


1036 


OR1D3P 


H38g88 
6 




OST954 


OR17-23 






FL 


1037 


ORlOJn 


H38g88 
7 










yes 


FL 


1038 


OR1D4 


H38g88 
8 






OR17-30 


+ 


yes 


FL 


1039 


ORSGnP 


H38g88 
9 












FL 


1040 


OR4SnP 


H38g89 
0 












FL 


1041 


OR5GnP 


H38g89 
1 












FL 


1042 


OR9HnP 


H38g89 
2 












FL 


1043 


OR1A1 


H38g89 
3 






OR17-7 


+ 


yes 


FL 


1044 


OR1A2 


H38g89 
4 






OR17-6 


+ 


yes 


FL 


1045 


OR8AnP 


H38g89 
5 












FL 


1046 


OR1P1P 


H38g89 
6 






OR17-208 


+ 




r L 


1047 


OR7E12P 


H38g8 9 
7 




OST955 


AC000378-A;OR11- 
3;hgl058 


+ 




r Li 


1048 


OR4A1P 


HJBgo y 
8 






OR11-30 






FL 


1049 


OR10G3 


H38g89 
9 






AE000658-D 




yes 


FL 



98 



WO 01/27158 



PCT/US00/27582 



SEQ 
ID # 


Symbol 


HORDE 


Digi 


OST 


Trivial 


Tran 


Int. 


E 


1050 


OR10G1P 


H38g90 
0 






AE000658-C 






FL 


1051 


OR10G2 


H38g90 
1 






AE000658-B 




yes 


FL 


1052 


OR5Tn 


H38g90 
2 










yes 


FL 


1053 


OR7EnP 


H38g90 
3 












FL 


X \J J ** 


OP7FnP 


4 












FL 




OP4£nP 


5 












FL 


X U D O 




fij oy ju 
6 






HSHTPCRX11 


+ 




FL 


IUj / 


DPI PnP 


7 














±\J DO 




no oy ju 

8 












FL 


1 fi E Q 

IUj J 


OP A Pn D 


9 












FL 


1UOU 


OP1 PnP 


H38g91 
0 












FL 


lUol 


HP r&TTn 


OrrQI 
no oy y ± 

1 










ves 


FL 


1U06 




no oyjA 
2 










yes 


FL 


luo J 


l/At V_li 


H3 8g91 
3 










yes 


FL 


X \J D *t 


OP 1 ^ Dn P 


n j oy 
4 












FL 


1065 

J- V D J 


0R5n 


H38g91 
5 


DSU116 






+ 






1066 




H3 8g91 
6 


DSU150 






+ 






1067 


ORn 


H38g91 
7 


DSU151 






+ 


put 




1068 


ORn 


H38g91 
8 


DSU17 






+ 






1069 


ORn 


H38g91 
9 


DSU18 






+ 







99 



WO 01/27158 



PCT/US00/27582 



SEQ 
ID # 


Symbol 


unonP 
HUKUr- 


Digi 




Trivial 


Tran 


Int. 


E 


1070 


ORil 


0 








+ 






1071 


OR6Fn 


H38g92 
1 


UbU4 1 






+ 






1072 


ORn 


H38g92 
2 


UbU4 7 






+ 






1073 


ORn 


H38g92 
3 


DSU50 






+ 






1074 


ORlOAn 


H38g92 
4 


DSU57 












1075 


ORn 


H38g92 
5 


DSU58 






4. 






1076 


OR2Ln 


H38g92 
6 


DSU59 












1077 


ORlOJn 


H38g92 
7 


DSU60 












1078 


ORlKn 


H38g92 
8 


DSU63 






+ 






1079 


ORlODn 


H38g92 
9 


DSU7 






4. 






1080 


ORn 


H38g93 
0 


DSU32 






4. 






1081 


OR2Ln 


H38g93 
1 


DSU38 






4. 






1082 


ORn 


H38g93 
2 


DSU62 






4. 






1 UH J 


nun 


3 


DSU4 8 






+ 






1084 


OR2n 


H38g93 
4 


DSU111 






+ 







Table 2 



SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


153 


OR10D3 


0 


11 


137.96 


SDVISV 


69 


M 


AC074177.4 


12106 . . . 
13038 


154 


0R7EnP 


4 


4 


11.58 


MVACGVLDLH 1 1 DS FAL 


53 


R 


AF091580.1 


7 ... 663 


155 


0R1D5 


0 


17 


3.75 


LVVTNLLYLLLLTGI FT 


49 


M 


AF073967.1 


2 ... 649 



100 



WO 01/27158 



PCT/US00/27582 



SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


156 


ORlONn 
P 


4 


11 


138.02 


LQGSGWHILFGNVLAT 


82 


M 


AC074177.4 


159287 
158526 


157 


OR2F1 


0 


7 


14 8 . 62 


LLGGFTSSVQI IbSLL x 


0 O 


M 

M 


ric U i J? ' 4 . JL 


4 X . . . 

649 


158 


OR7EnP 


7 


4 


IT CO 

11.58 


MAGGLLLDLH 1 Lr ALbJj 




M 

VI 




S47 
1515 


159 


OR8FnP 


6 


11 


137.96 


LLVICEMGAHCVCSNIF 


75 


M 


AC069561.1 
n 


51687 ... 

DU / ^ J 


160 


OR2Q1P 


2 


7 


148.62 


LLCGFSANMEIVSGVIL 


49 


M 


AC020865.3 


190954 
189954 


161 


OR2W1 


0 


6 


33.74 


LMGSCMINVLLVLGIVT 


88 


M 


AF102516.1 


52 ... 
669 


162 


OR7EnP 


7 


4 


11.58 


MVACGVLDLH ITHSFGL 


53 


R 


AF091580.1 


7 ... 663 


163 


OR6B1 


0 


7 


148.62 


LIMCCGI IAKFDLAIFF 


61 


M 


NM_010983. 
1 


178 ... 

9/0 


164 


ORlOKn 


0 


1 


154 .34 


MLG S S AC VVT L I LGAL I 


79 


M 


AC073778.1 


168744 
167803 


165 


ORnP 


13 


11 


138.02 


VPYCIGGHLLICLSLSS 


33 


M 


AC074177.4 


12106 . . . 
13038 


166 


OR4F2P 


; 4 


6 


186.49 


IHGGMVLHFQFVNSICG 


50 


M 


AB030896.1 


1 ... 906 


167 


OR7EnP 


3 


4 


11.58 


MVACGVLDLH I I DSFGL 


54 


M 


AF102536.1 


22 . . . 
669 


168 


OR1F2P 


0 


16 


6.15 


MSADNGVNLHLIEAVTT 


72 


R 


M64377.1 


1 ... 939 


169 


OR2P1P 


7 


6 


33.74 


FGGSCMSNQSALVRXSV 


4 8 


M 


NM UUo7b2. 
1 


i ... yjo 


170 


OR7E43 
P 


« 5 


4 


5.57 


MAGGELFDLHIMPAFGL 


54 


M 


AF102536.1 


22 . . . 

669 


171 


OR4F1 


4 


6 


0.23 


IHGGMVLHFQFVNSICG 


50 


M | 


AB030896.1 


1 ... 906 


172 


OR7E55 
P 


5 


3 


89.94 


MAGDEFLDLHILPAFGL 


53 


M. 


AF073989.1 


547 ... 

1D1D 


173 


OR13Dn 


0 


9 


86.89 



MLGSCWITLQLMTNSLI 


61 


M 


AC023789.5 


371264 
372220 


174 


OR4CnP 


3 


16 




AHGAIVGHIQFVNSICL 


74 


M 


AF102522.1 


40 . . . 


X / 0 


on i nm 

P 


i 

X 


X X 


X J / • Zf \J 


LHGCCGFOFLLGSVMPS 


83 


M 


AC074177 . 4 


128803 
129726 


176 


OR4Cn 


0 


16 




LHGGIVGHVQLVNSICL 


86 


M 


AB030895.1 


1 ... 924 



101 



WO 01/27158 



PCT/US00/27582 



SEQ 
ID « 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


177 


OR8GnP 


0 


11 


137.96 


LSAICGLGIHFVLSNIM 


73 


M 


AC074177. 4 


106297 
105361 


178 


OR13Cn 
P 


2 


9 


86.85 


MFGACGGNLQLMAS FLG 


82 


M 


AJZd 1 lo4 . 1 


O "7 n "5 

Z / U J ... 

1747 


179 


0R4CnP 


5 


16 




LHEAIVLHIQFINSLCL 


61 


M 


AFlUzozz . 1 


4 U ... 
660 


180 


OR13Cn 


0 


9 


86.81 


MLGTCG I NVQFMAT FIT 


69 


M 


AJ133425. 1 


61 . . . 
1014 


181 


0R4CnP 


0 


16 




LHGGIMGHIQLVNSMCL 


63 


M 


AB030895. 1 


1 ... 924 


182 


OR51Bn 


0 


11 




AHSVSGRSPVRPLITIL 


76 


M 


AF071080. 2 


15931 . . . 
16851 


183 


OR7E5P 


2 


11 


51.76 


MVACDVLDLH I IDS FGL 


54 


M 


AF073989. 1 


547 ... 

1515 


184 


ORl3Cn 


0 


9 


86.77 


MFGSCVSNVQLMSNFLL 


71 


M 


AJ251154. 1 


2703 ... 
1747 


1 Ot 
J. DO 


OR A 
wrvi oil 


n 
\j 






LHGG I AAHLQLVNS I S A 


56 


M 


AB030895. 1 


1 ... 924 


186 


OR51Bn 

o 
r 


4 


11 




VHYPEWRSPPPPLVIFL 


72 


M 


AF071080.2 


15931 ... 
16851 


187 


0R6JnP 


1 


14 


2.72 


CFGTFFGSFPLDLSVIC 


50 


R 


M64378.1 


1 ... 933 


188 


OR51Bn 


0 


11 




S HAI SGRS PIS PQTTVL 


76 


M 


AF071080 . 2 


26330 . . . 
27262 


189 


OR7EnP 


2 


11 


71.8 


MFACGVLDLH I I DSFGL 


55 


M 


AF102536. 1 


22 ... 
669 


190 


0R2An 


0 


6 


144 .32 


T S A V C T T L I H L VGAGLG 


81 


M 


L14566. 1 


62 ... 
667 


191 


OR7E22 
P 


3 


3 


89. 94 


MVACDVLDLH I I DSFGL 


56 


M 


AF073989. 1 


547 . . . 
1515 


192 


OR7E4P 


2 


11 


71.8 


IVACDVLDLHIMHSFGL 


55 


M 


AF102536. 1 


22 ... 
669 


193 


OR7E66 
P 


9 


3 


89. 94 


MAGGELLFLHIMPAFGL 


55 


M 


AF073989. 1 


54 7 ... 
1515 


194 


OR6Mn 


0 


11 


138.18 


TFGTFGGSFPVNLSVIS 


50 


M 


NM 010991. 
1 


1 ... 


195 


0R2ALn 
P 


11 


11 


112.69 


ILGTCASNFDFFNHLLL 


32 


M 


AL359352 . 1 


85325 . . . 
86251 


196 


OR6MnP 


2 


11 


138.18 


TGGTFGGSCPVNLSILT 


50 


M 


NM 010991. 
1 


l ... yjy 


197 


0R4D1 


0 


17 


CLC\ 1 

bU . / 




90 


M 


AC019272 . 4 


62255 . . . 
61317 


198 


OR5D2P 


3 


11 


51.09 


LCVVTTWCTLFTSANES 


48 


M 


AC073947.3 


29192 . . . 
30115 



102 



WO 01/27158 



PCTrtJSOO/27582 



SEQ 
ID # 


Symbol 


D 


c 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


199 


0R7E38 
P 


7 


7 


95.91 


MAGGELFHLHIMPAFGL 


55 


R 


AF091580.1 


7 ... 663 


200 


OR4D2 


0 


17 


60.7 


I HGGVAGHVQLKNSLDV 


89 


M 


AC019272.4 


183633 
182701 


201 


OR7E7P 


4 


7 


95.91 


MI ACGVLDLHI I DS FGL 


56 


R 


AF091580.1 


7 ... 663 


202 


OR5AHn 
P 


0 


19 


68.97 


RSGIMC 


77 


M 


AC020957.2 


48184 ... 
49107 


203 


OR2U2P 


5 


6 


33.53 


LVYSCIVNIPYTMCIVV 


49 


M 


AC044846.2 


105668 
104736 


204 


OR2U1P 


2 


6 


33.53 


LVCTCMINILCCWIFA 


54 


M 


AF102516.1 


52 . . . 

669 


205 


OR2H2 


0 


6 


33.19 


ILGTCVI EVQS VAS I LV 


89 


M 


AL078630.1 


41097 ... 
40165 


206 


OR2H5P 


7 


6 


33.19 


FLGT C V I EVQSMAS I L V 


84 


M 


AL078630.1 


41097 ... 
40165 


207 


OR2In 


0 


6 


33. 19 


LLGSCASNAQLMARILL 


74 


M 


AL078630.1 


151152 
150391 


208 


ORllHn 
P 


5 


13 




IFNTCLCWIPLCLSVIG 


60 


M 


AF121972.1 


171 ... 
1109 


209 


OR7EnP 


6 






AAACDVIDLHITHSFGL 


56 


M 


AF073964 .1 


41 . . . 
649 


210 


OR9In 


0 


11 


54 .06 


FTAGCGCGLRCI FGVIA 


50 


R 


AF091579.1 


7 ... 663 


211 


OR2AFn 
P 


11 


X 


140.17 


MLGTCGHVTLAGISTLL 


43 


R 


L34074.1 


73 . . . 
1011 


212 


OR13Kn 
P 


5 


X 


140. 17 


MFGMCVIIIHLGIGTLL 


43 


R 


L34074 .1 


73 . . . 
1011 


213 


OR13Cn 


0 


9 


86.77 


MFGSCVSNVQLLSNFLL 


68 


M 


AJ251154.1 


2703 ... 
1747 


214 


OR13Fn 


0 


9 


86.77 


MLGSCGTTVESMISLLM 


55 


M 


AJ133428.1 


61 . . . 
1017 


215 


OR9Qn 


0 


11 


54 .08 


FTGSCGASVRSIFAVIA 


47 


M 


AF146372.1 


509 ... 
1456 


216 


OR2TnP 


1 


1 


254 .77 


I LI G FGG DML VMCCML I 


71 


M 


AF102527.1 


22 . . . 
669 


217 


OR4Kn 


0 


14 


0.08 


IHVGMIVHSHFTNSISS 


56 


M 


AF259072.1 


104176 
105099 


218 


OR2B8P 


c 


6 


31.6 


LLGSCT I NLQLLVS I LV 


62 


R 


L34074.1 


73 . . . 
1011 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


219 


OR2Tn 


0 


1 


254 .77 


MLAGVALDLLITCCMLT 


57 


M 


AF102527.1 


22 . . . 

669 


220 


OR4Kn 


0 


14 


0.08 


IHTGIAMHSQFMTSIAS 


53 


M 


AF259072.1 


104176 
105099 


221 


OR2A4 


0 


6 


144.76 


TSAVCTTLIHLVGAGLG 


81 


M 


L14566.1 


62 . . . 
667 


222 


OR7EnP 


6 


2 


161.53 


MVACDVLDLHI I DS FGL 


54 


R 


AF091580. 1 


7 ... 663 


223 


OR4Kn 


o 


14 


0.08 


MHGGILVHSQFMTSIAV 


57 


M 


AF259072.1 


104176 
105099 


224 


OR13In 
P 


6 


9 


86.85 


MYGSCVLNNWIGKTLL 


41 


M 


AJ251155.1 


15491 . . . 


225 


OR7EnP 


8 


2 


161.53 


MVACDVLDLHI FFDFGL 


54 


M 


AF073989.1 


547 ... 
1515 


226 


OR6Jn 


0 


14 


2.72 


CFGTFFGSFPLDLSVIC 


50 


R 


M64378.1 


1 ... 933 


227 


OR4Mn 


0 


14 


0.08 


LHGAMLGHIQLMSSISV 


54 


M 


AC019272.4 


183633 
1 oz / UJ. 


228 


OR4VnP 


10 


11 


51.09 


I HG 1 1 VLH FQMVN S FAV 


50 


M 


AB030896.1 


1 ... 906 


229 


OR6Xn 


0 


11 


138.36 


AFGTFSVICQLGATVIG 


46 


M 


AF106007.1 


178 ... 

975 


230 


OR51Gn 


0 


11 


3.7 


LHSSSSRLPLLGWTW 


55 


M 


NM 013617. 
1 


1 ... 921 


231 


OR6EnP 


3 


14 


2.72 


SFGTFCTLI PLGIASLG 


82 


M 


NM_010991. 
1 


1 ... 939 


232 


OR4NnP 


2 


14 


0.08 


LHGGGAGHIQLMNSMTL 


54 


M 


AC019272.4 


62255 ... 


233 


OR6MnP 


7 


11 


138.18 


I FGT FGGARLVSXSMVT 


37 


R 


M64378.1 


1 ... 933 


234 


OR4Nn 


0 


14 


0.08 


LHGGGAGH IQLMNSMTL 


57 


M 


AC019272.4 


62255 . . . 
61317 


C ~> ~J 


OR4Cn 


o 


11 


51 . 09 


LHGGIGGHIQFVNSMCA 


65 


M 


AF102522.1 


40 . . . 
660 


236 


OR4KnP 


4 


14 


0.08 


I HAGMGTHSQFMDSMGT 


51 


M 


AF259072.1 


104176 
105099 


237 


ORnP 


8 


11 


137.59 


AI AI T WVAHAAAG WA 


35 


M 


AC069559.8 


73704 . . . 
74 636 


238 


OR5D3 


0 


11 


51.15 


FCWTAWCTYFISANES 


46 


R 


U50948.1 


34 ... 

978 


239 


OR2G1P 


6 


6 


33.53 


LLGSCVSNIQVLASLLL 


84 


M | 


AL359352.1 


85325 . . . 
86251 
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SEQ 
ID # 


Symbol 


D 


C 1 


coord 


CDR 


% 


S 


Acc 


Range 


240 


0R4Kn 


0 


14 


0.08 


IHTGMIVHSQFINSLSS 


51 


M 


AF259072.1 


104176 
105099 


241 i 


OR8BnP 


2 


11 


137.59 


LCVFSGMGAHNVIVGIV 


68 


M 


AC069559.8 


120212 
119283 


242 


OR2B2 


0 


6 


31.47 


LLGSCASNLQWLISFLI 


89 


R 


L34074.1 


73 . . . 
1011 


243 


OR7EnP 


3 


2 


73.87 


MVACDVLDLRI I DS FGL 


54 


M 


AF073989. 1 


547 ... 
1515 


244 


OR4KnP 


3 


14 


0.08 


IHTGIWHSQFMTSIAI 


57 


M 


AB030896. 1 


1 ... 906 


245 


OR2AD1 
p 


6 


6 


33.87 


FLGACTSS I VLVFGFLV 


51 


M 


AL136158. 1 
4 


162423 
161461 


246 


ORlAAn 
P 


8 


X 


140.17 


MIVDNTIVLHLIIGVII 


48 


M 


AC068902. 1 
1 


144125 
i a*? 


247 


OR1E3P 


1 


17 


2. 99 


MLGVSLLHLHLMMGILI 


74 


R 


M64392.1 


1 ... 942 


248 


OR8BnP 


3 


11 


137.59 


FCVFSGMGAH N I WG I V 


63 


M 


AC069561. 1 

U 


96653 . . . 


249 


OR5Hn 


0 


3 


104.18 


FAGTCFGHIHLVLSIQF 


55 


R 


AF091575. 1 


52 . . . 
663 


250 


OR1G1 


0 


17 


2.99 


LMVMAAMHLHLITGTGI 


56 


R 


M64392.1 


1 ... 942 




no ^ Ut\ p 
\Je\D nnr 




3 


104 . 18 


FAVTCGGHIHFVFSIQF 


46 


M 


AC068904. 1 
5 


165039 
165965 




AD n D 

UrUi t 


5 


x 


140.17 


MLVTCSHHFLSFTGIWS 


36 


R 


U50948. 1 


34 ... 
978 


253 


ORnP 


11 


X 


140. 17 


LIVTFAKITTTQDHHHH 


29 


M 


AC069561. 1 
0 


127636 
126698 


254 


OR4PnP 


2 


11 


51.09 


LHGDIAGHSQLVNSISL 


Ol 






1 ... 924 


255 


OR13Hn 


0 


X 


140. 17 


TLATCTTVAMLITSTLL 


A ~7 


M 




35662 . . . 
36615 


256 


OR7D1P 


5 


19 


11.38 


VMAGTAI FVHLLATLGF 


64 


R 


AF091580. 1 


7 ... 663 


257 


OR4KnP 


2 


18 


47.77 


IHNGIWHSQFMl blAl 


DD 


M 


nDU JUU J \J • X 


1 ... 906 


258 


OR7E24 


1 


19 


11.38 


MVACDLIDLHIIMGFGL 


60 


R 


AF091580. 1 


7 ... 663 


259 


OR51Nn 
P 


2 


11 


3.6 


LHGFSARSPSLGVLVTV 


49 


R 


AF079864 . 1 


Ooc . . . 

1576 


260 


OR7E18 
P 


6 


19 


11.38 


VAGCDLLDLH IMLAFGL 


59 


M 


AF102536. 1 


22 . . . 
669 
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SEQ 


Symbol 


D 


c r 


4b < 
rroord 


:dr 


% 


S 


Acc 


Range 


[ID # 
261 < 


DR7E19 
P 


2 


19 


11.38 


WVCDVLNLHIMDSFGL 


58 


M 


AF073989 . 1 


C A 1 

b4 / ... 
1515 


262 


DR7E41 
P 


7 


11 


14 .36 


IWCDMLDLHIHSTFGL 


55 


M 


AF073989 . 1 


t A 1 

54 / ... 
1515 


263 


0R2R1 


3 


7 


148. 69 


LLGGFVVNMELISSVLV 


77 


M 


AF073974 .1 


41 . . . 
64 9 


264 


OR 10 AC 
nP 


7 


7 


148. 69 


MVGGCGRVGLLLACLLL 


46 


M 


AC073778.1 


168744 
167803 


265 


OR51Ln 


0 


11 


3.79 


LHTFSARVPTLGWTLV 


54 


R 


AF079864 . 1 


632 . . . 
1576 


266 


OR52Jn 
P 


3 


11 


3.79 


MH TGS S RL P I LGVALDA 


57 


M 


AF121979.1 


53 . . . 
1106 


267 


OR9LnP 


9 


8 


45.22 


TVVNNFFFFFFIFDLIA 


37 


M 


AC069561.1 
0 


147203 
146274 


268 


OR51Pn 
P 


4 


11 


3.79 


MHSISARLPALGWSML 


48 


M 


AF071080.2 


2641 . . . 
1697 


269 


OR5HnP 


4 


3 


104 . 18 


FAVTCLGHIHFFFSIQL 


50 


R 


AF091575.1 


52 . . . 

663 


270 


ORSlAn 


0 


11 


3.79 


EHSVSVKLPFTYFGCLV 


48 


R 


AF079864 .1 


632 ... 
1576 


271 


ORSHnP 


6 


3 


104.18 


FAVTCLGHIHFVFSIQF 


46 


M 


AC068904 .1 
5 


165039 
165965 


272 


ORnP 


11 


17 


17.43 


LLPCILSIIALYYYYYY 


27 


M 


AL359352.1 


9138 . . . 
8177 


273 


OR52En 


0 


11 


3.79 


MHTGSARFPFFYCAILF 


57 


M 


AF121979.1 


53 . . . 
1106 


274 


OR5Hn 


0 


3 


104 .18 


F WT CLG H I H F V FA VQ F 


53 


R 


AF091575.1 


52 . . . 
663 


275 . 


OR4CnP 


3 


11 


50.21 


VHRGVVGHIQFVNSICL 


73 


M 


AF102522.1 


40 ... 

660 


276 


OR52En 


0 


11 


3.79 


MHTLSGRFPSLYCANLF 


60 


M 


AF121979.1 


53 ... 
1106 


277 


ORlODn 


c 


11 


138 


LHGCCGIHILLGNVLSI 


86 


M 


AC074177 . 4 


12106 . . . 
13038 


278 


OR5HnP 


2 


3 


104 . 18 


FWTCLGHIHFVFAIQF 


54 


R 


AF091575 . 1 


52 ... 
663 


279 


ORl3An 


c 


) 1C 


4 7.91 




67 


M 


AF102520 . 1 


16 . . . 
669 


280 


OR5HnP 




> 2 


104 .18 


FGGTCLGHIHILLSIQF 


57 


R 


AF091575.1 


52 . . . 

663 
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SEQ 
ID # 


Symbol 


D 


c 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


281 


OR5Kn 


0 


3 


104.47 


FCETCGAHIHLLFSVQF 


45 


M 


AC069559.8 


36251 ... 
35322 


282 


OR7EnP 


9 


21 


17.99 


MAGGELFHLQIMPAFGL 


57 


M 


AF073989.1 


547 ... 
1515 


283 


OR4DnP 


6 


8 


77.48 


IHGGVAGHVQVMNSLVI 


87 


M 


AC019272.4 


62255 ... 
61317 


284 


OR2ARn 
P 


0 


3 


30.89 




71 


M 


AJ251154.1 


56533 . . . 
57369 


285 


OR7E2 9 
P 


4 


I 3 


136.03 


MAGGELLDLHIMPAFGL 


56 


M 


AF073989.1 


547 ... 
1515 


286 


OR4CnP 


3 


11 


51. 12 


AHGAIVGHIQFVNSICL 


74 


M 


AF102522.1 


40 . . . 

DDU 


287 


OR5PnP 


2 


11 


6. 93 


LVGTCVGNTFCPSSIIV 


74 


M 


AF121977.1 


262 ... 
1 1 y / 


288 


OR7EnP 


5 


3 


136.04 


MVACGVLDLH I IGS FGL 


52 


R 


AF091580.1 


7 ... 663 


289 


OR56An 


0 


11 


; 4 .73 


MNLPSFRLPILQAGLLS 


41 


M 


AF121975.1 


50 . . . 
1012 


290 


OR56An 
P 


9 


11 


4 .73 


KNQAFFRMPILQGGLLS 


73 


M 


AF121981.1 


89 . . . 

475 


291 


OR5Pn 


0 


11 


6.89 


LAATCVAISYSLSSIIV 


63 


M 


AF121977.1 


262 ... 
1197 


292 


OR7E53 
P 


5 


3 


136.04 


MAGGEFPDLHIMPAFGL 


54 


M 


AF073989.1 


547 ... 
1515 


293 


OR5Pn 


0 


11 


6.89 


LVGTCMGNTFCPSSIIA 


83 


M 


AF121977.1 


262 ... ! 
1197 


294 


OR52Ln 


0 


11 


4.73 


MH S S S VRL P FLGMAV I L 


59 


M 


AF121976.2 


474 ... 
1307 


295 


OR5E1 


3 


11 


6.89 


LGATXGYNIQLLFSNLG 


51 


R 


U50948.1 


34 ... 

01 Q 


296 


OR56An 
P 


3 


11 


4.73 


MNLASFRMAILPPPPPP 


39 


M 


AF121976.2 


474 ... 

i jU / 


297 


OR4KnP 


2 


8 


88 . 25 


IHTGMIVHSQFIDS . . . 


D 1 


KA 

M 


ADninQQC 1 

AdUjuo y d . i 


1 QA £ 
1 ... D 


298 


OR52Ln 


0 


11 


4.73 


MHSSSVRLPFLGVAWL 


59 


M 


AF121976.2 


474 ... 


299 


OR7EnP 


1 


4 


74 .82 


MVF 


55 


R 


AF091580.1 


1 ... 663 


300 


OR52Xn 
P 


5 


11 


4.73 


MHSASLXLS FLAVALGG 


51 


M 


AF121976.2 


474 ... 

1307 


301 


ORnP 


13 


4 


74.82 


STGCKGRKXLKLVRDFQ 


24 


R 


M64386.1 


130 ... 
975 


302 


OR56An 


0 


11 


4.73 


MNLTS FRVPVLQAGLLS 


84 


M 


AF121981.1 


89 . . . 
475 



107 



WO 01/27158 



PCT7USO0/27582 



SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


303 


OR56An 
P 


10 


11 


4 .73 


LI. . . GMMXNL . . . KKK 


60 


M 


AF121981.1 


89 . . . 
475 


304 


OR1R1P 


5 


17 


3 


MVGISAVHLHLIEGWA 


48 


M 


AF073967.1 


2 ... 649 


305 


OR52En 
P 


2 


11 


3.79 


MHTGSGRSPFLYGAILF 


64 


M 


AF121979.1 


53 . . . 
1106 


306 


OR51An 
P 


4 


11 


3.7 


EHTVALKLPLLGAGSTL 


46 


R 


AF079864.1 


632 ... 
1576 


307 


ORSlAn 


0 


11 


3.7 


EHSVSVKLPFTYFGCLV 


48 


R 


AF07 98 64.1 


632 ... 
1576 


308 


OR4CnP 


1 


11 


51. 12 


VHGGVVGHVQFVNS I CL 


75 


M 


AF102522.1 


40 . . . 

660 


309 


OR52Jn 
P 


9 


11 


3.79 


MHTGACRFPI LGVVYLN 


58 


M 


AF121979.1 


53 ... 
1106 


310 


OR4RnP 


9 


11 


51.12 


GGGVXSVNGNYL 


66 


M 


AF102522.1 


40 . . . 

660 


311 


OR52Jn 


0 


11 


3.79 


MHTGACRL PMLGWFVN 


58 


M 


AF121976.2 


474 ... 

1307 


312 


OR4CnP 


3 


11 


51.12 


VHGGGVGHIQFINSICL 


76 


M 


AF102522.1 


40 . . . 
660 


313 


ORSlAn 
P 


2 


11 


3.79 


EHSASAKLPFTYFVTGL 


83 


M 


AF121985.1 


2 ... 478 


314 


OR7EnP 


15 


12 


93.55 


I VVCDLLDLHIHSTFGL 


55 


M 


AF073989.1 


547 ... 
1515 


315 


ORSMnP 


2 


11 


52. 17 


CIVLHVYLMERMVASNQ 


54 


M 


AF102528.1 


52 . . . 
669 


316 


OR10AB 
nP 


1 


11 


6. 93 


MLASCAVFCITILSVLG 


47 


M 


AC073778.1 


168744 


317 


OR52Sn 
P 


2 


11 


3.79 


MHSTSARLPHLSVATGV 


54 


M 


AF121976.2 


474 ... 


318 


OR5Mn 


0 


11 


52 . 14 


C I VH I FYTAAWMLAN FY 


A Q 

4 y 


K 


At u y i o / y . 1 


/ ... DDJ 


319 


ORlOSn 


0 


11 


138.1 


LHASC I I HI HLMS I VAG 


61 


M 


AF259072.1 


32953 ... 
32000 


320 


ORSMnP 


4 


11 


52.14 


C I V H I FYTTAWMLAN FY 


48 


R 


AF091579. 1 


7 ... 663 


321 


ORlOGn 


0 


11 


138.1 


LHGSCGSHVQLI DI VAG 


61 


M 


AF259072.1 


55611 ... 
54658 


322 


ORnP 


20 


11 


29.15 


ILGIYEGSAHYFIILFL 


33 


M 


AL365337. 1 


192661 
191711 


323 


OR5MnP 


2 


11 


52.19 


C I V I YG YSMEWMVANLS 


54 


M 


AF102528. 1 


52 . . . 
669 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


324 


ORlOGn 
P 


10 


11 


138.1 


LYGSCWGHLPI YVIKFT 


30 


M 


L14567.1 


17 ... 

667 


325 


ORlOTn 
P 


1 


1 


154 .34 


LVACCACTIVLILSVLV 


57 


M 


X92969.1 


8035 ... 
8961 


326 


ORnP 


16 


11 


52.17 


LAAPLLLV FVLAAAAAA 


33 


R 


M64376.1 


1 ... 999 


327 


ORlORn 
P 


11 


1 


154 .5 


MLA V FT I C V FL I GG ALV 


47 


M 


AC023611.2 


108224 
107271 


328 


OR5MnP 


2 


11 


52.16 


C I VH LV Y TMEWMVAN FY 


49 


R. 


AF091579.1 


7 ... 663 


329 


OR7EnP 


4 


8 


6.68 


MLACGVLDLHI I DS FGL 


55 


M 


AF102536.1 


22 . . . 
669 


330 


ORlOTn 


0 


1 


154 .27 


LLACCLTIVALLLSVIV 


58 


M 


AC012302.5 


54283 ... 
55224 


331 


OR1E1 


0 


17 




MT rnCT 7 UT UT TMl^TT T 


83 




Y07557 . 1 


1 ... 942 


332 


ORSBKn 
P 


4 


12 


M O 11 

42.11 


o lCjbAlAiriUr LioL^WVjIj 




M 


AF073965. 1 


2 ... 643 


333 


OR5MnP 


3 


11 


CO IT 

52 . 17 


r'TWU TWVTMIT'IaJMWZIMT TT 

CI VH1 V I IMbWMVAINJLir 


H O 


R 


AF091579 1 


7 ... 663 


334 


OR3A3 


0 


17 


3.06 


LHAGCACNTHALAAMAA 


49 


M 


AF073967.1 


2 ... 649 


335 


OR 10 AD 
nP 


1 


12 


42.11 


TFGVCTFNFLI I DAVI S 


44 


M 


AF247657.1 


1 ... 945 


336 


ORlORn 


0 


1 


154 .5 


MLAICAGATVLICGVLV 


56 


M 


AC073778.1 


168744 
167803 


337 


OR5TnP 


4 


11 


51.94 


MCGTCAAHIHAFFVIEV 


51 


M 


AF121977.1 


262 ... 
1197 


338 


OR4GnP 


15 


7 






42 


M 


AB030892 . 1 


1 . . . 939 


339 


OR6Yn 


0 


1 


154 .5 


LWC YGCTI KFDLAV 1 1 


61 


M 


NM_010983. 
1 


178 ... 

975 


340 


OR1E2 


0 


17 


3. 15 


MLSDSLLHLHLIMGILI 


80 


R 


Y07557.1 


1 ... 942 


341 


OR8Hn 


0 


11 


51.94 


MVGACG INVNWI LATLV 


51 


M 


NM 013728. 
1 


1 ... 948 


342 


OR4 Fn 


0 


7 


0.23 


IHGGMVIHSQFVNSLTC 


50 


M 


AC019272.4 


62255 ... 
61317 


343 


ORlOKn 


0 


1 


154.27 


MLGC SAC VI I LI LC VL I 


83 


M 


AC073778.1 


168744 
167803 


344 


OR7LnP 


11 


X 


140.17 


MLGVCGHGTNLXFFFFI 


32 


M 


ALlooloU . 1 


bj7J^ . . . 
64759 


345 


OR8InP 


7 


11 


51.94 


MWCCMINVSVSLATLG 


44 


R 


M64386.1 


130 ... 
975 
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ID # 


<3 \m\hol 


D 


c b 

c 


lb ( 
:oord 


:dr 


I 


5 i 


hcc 


Range 


I 


)R1 ORn 

> 


0 


1 


154.5 I 


1LAVCTSIVGFIFGVLV 


54 


M 


AC073778.1 


168744 
167803 


347 < 
1 


}R2AFn 

p 


11 


X 


140.17 


N4LGTCGHVTLAGISTLL 


43 


R 


L34074 .1 


73 ... 
1011 


348 < 


3R8Kn 


o 


11 


51.94 


LEIILVYVFLKIFSNLF 


55 


M 


AF102528.1 


52 . . . 
669 


349 < 


DRnP 


7 


10 


127.57 


S . CCCLLTYI IHHHHHH 


31 


M 


AC020958.1 


164590 
163746 




Ur\.o i\iir 


10 


11 


51.94 


MI IILIYQMVKI FSNLF 


35 


M 


AC073945.4 


152209 
153150 


351 1 


OR51Hn 


0 


11 


3.6 


MHGI SSRVPVLGWTLL 


49 


R 


AF079864.1 


632 ... 
1576 


352 


OR7EnP 


5 


3 


136.03 


MVACGVLDLHI I DSFGL 


51 


M 


AF073989.1 


547 ... 

1515 


353 


ORnP 


8 


3 


56.17 




32 


R 


M64376.1 


1 ... 999 


354 


OR5BMn 
P 


20 


3 


103. 93 


KXNKCTLSSSLMVFIQF 


30 


M 


AF146372.1 


509 ... 
1456 


1 c c 


UHlUbn 
P 




11 


138 . 1 


LHGCCGGHFQFTDILAT 


63 


M 


AF259072. 1 


55611 ... 
54658 


356 


OR2Yn 


0 


5 


209.23 


LLGSCAANIQLMARWV 


74 


M 


AC044846.2 


139468 
138536 


*3 C 1 


jUKiuun 
P 


± 


11 


138 . 1 


LHGCCGGHVLLSNWAM 


66 


M 


AC074177.4 


128803 
129726 


358 


OR3BnP 


1 


X 


158.48 


IHAPSILNTYLLSFVAA 


37 


M 


AL136158. 1 
4 


29455 . . . 
30402 


359 


OR8Dn 


0 


11 


138.1 


LCVICAVDIHCIIGNMA 


62 


R 


X80671.1 


203 ... 
1129 


360 


OR5RnP 


0 


11 


52.13 


LLMICVYVFHIIFADMS 


68 


M 


AF102528.1 


52 . . . 
669 


361 


ORlOGn 


c 


11 


138.1 


LHGSCGSHVQLINIVAG 


58 


M 


AF259072.1 


55611 ... 
54658 


362 


ORSBDn 
P 


12 


. 13 


. 53.74 


MTGTCWIHRALSSITP 


39 


M 


NM 013728. 
1 


1 ... 948 


363 


OR5ALn 
P 


1 


. 1] 


l 52. i: 


J VIVVLSYWQALIANTC 


52 


M 


AC073947.3 


29192 . . . 
30115 


364 


OR52Hn 
P 




3 i: 


L 4 . If 


) LHFVSGRVPCLGVPTVi 




M 


AF1 2 197 5 . 1 


50 . . . 
1012 


365 


ORlOGn 


( 


d i: 


L 138.] 


I LHGGCSSHVQLITVVAG 


5€ 


M 


AF259072. 1 


55611 ... 
54658 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR ! 


% 


S 


Acc 


Range 


366 


0R5Mn 


0 


11 


52.17 


CIVHIVYTMEWMVANLF 


52 


M 


AF146372 . 1 


509 . . . 
1456 


367 


OR51Mn 


0 


11 


4 .15 


MHSFSIRAPILGWTVL 


50 


M 


NM 013617. 
1 


1 ... 921 


368 


OR6Tn 


0 


11 


138.1 


S FGT FAAWC PLALS VLG 


52 


M 


NM 010991. 
1 


i t\ -i n 

1 ... 939 


369 


OR6DnP 


5 


10 




SLGS FWLGLKALVVLT 


69 


R 


AF034903. 1 


85 ... 
1053 


370 


OR4B1 


0 


11 


45.36 


IHGVIGGHIQWNSFSF 


62 


M 


AF102522 . 1 


40 . . . 
660 


371 


OR5ALn 
P 


4 


11 


52. 13 


VISWGYMIQALIANVC 


50 


M 


AF146372.1 


509 . . . 
1456 


372 


OR51Qn 


0 


11 


4 .15 


FH S FS ACAPS LGLAI I V 


49 


M 


NM 013617. 
1 


1 . . . 921 


373 


OR4 Dn 


0 


11 


138.1 


LHGGIAGHVQLMNNVTM 


63 


M 


AC019272.4 


62255 . . . 
61317 


374 


OR52Nn 


u 


i X 


A 

1 • JO 


MHTf^T.RLPSLGVAIGF 


52 


M 


NM 013619. 
1 


118 ... 
969 


375 


OR4Xn 


A 

u 


1 J- 


S J • JO 


MHftflATGHGOLINGISV 


58 


M 


AB030896. 1 


1 ... 906 


376 


OR8Jn 


0 


11 


52.03 


LL I VVLYTWYVSANVG 


77 


M 


X89682.1 


2 ... 472 


377 


OR51Jn 
P 


2 


11 


4.15 


MHSMSIKLPLLGIVTFL 


46 


M 


AF071080.2 


15931 . . . 
16851 


378 


ORlOGn 


0 


11 


138.1 


LHGSCSSHVQLIDIVAG 


60 


M 


AF259072.1 


55611 ... 
54658 


379 


OR52EI1 


u 


X X 


A ^P, 
H . -J O 


MHTfiTVRLPFLGVI I I D 


66 


M 


AF121979.1 


53 . . . 
1106 


380 


OR4Xn 


0 


11 


45.36 


LHGGIIGHAQLINGLSI 


64 


M 


AB030895.1 


1 ... 924 


381 


ORIOA^ 


: i 
1 


X X 




MF^VCAPWOWAGTWI 


76 


M 


AF247657. 1 


1 ... 945 


382 


OR5Mn 


0 


11 


52.14 


CIVHWYVICWMIANFY 


49 


R 


AF091579.1 


7 ... 663 


383 


OR52En 


0 


11 


4.58 


MHTGSVRFPFLISWGI 


59 


M 


AF121979.1 


53 . . . 
1106 


384 


OR8Kn 


0 


11 


51.94 


LLIGLIYILVKIFADLS 


53 


M 


AF146372.1 


509 ... 
1456 


385 


ORlOAn 


0 


11 


5. 66 


M FG AC AS WQWAAT F I F 


89 


M 


AF247657.1 


1 ... 945 


386 


OR8LnP 


3 


11 


52.13 


LIWMSYVLQLLLANTF 


51 


M 


AF102528 . 1 


52 ... 
669 


387 


ORSBPn 
P 


8 


11 


52 . 82 


VV V VVuljo IVirrvljijnlj 




R 

IN 


U50948 1 


34 ... 

978 


388 


OR52Nn 


0 


11 


4.58 


MHTGSARLPFLGVAIGF 


54 


M 


AF121976.2 


474 ... 

1307 
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SEQ 

t n # 


Symbol 


D 


c 


Mb 

coord 


CDR 


% 


S 


Acc 


Da nno 


"3Q Q 
JO ? 


HRnP 


7 


11 


45. 36 


WWWWWI ALLR . AAAAAK 


28 


M 


X89686.1 


32 . . . 

472 


t on 
J yu 


np P .7n P 

VJt\0 U 1 1 1 


1 


11 


51 . 94 


LLIVILQTTVCVFSNLF 


99 


M 


X89682.1 


2 ... 472 


391 


OR5Mn 


0 


11 


52.24 


CI VI FVYNSQLMVATLS 


50 


R 


AF091579.1 


7 ... 663 


392 


OR52En 


0 


11 


4 .58 


MHTVS I RMPLLGS I LLL 


66 


M 




DO ... 

1106 


393 


OR5Tn 


0 


11 


51.94 


VCGTC AAH I HALFV I E V 


52 


M 


AF1 4 637^ . 1 


CftQ 

1456 


394 


OR52Nn 
P 


5 


11 


4.58 


MHTGSVQLPFLGAAIGF 


51 


M 


NM 013619. 
1 


118 ... 

969 


395 


OR4B2P 


6 


11 


45.36 


I FG 1 1 GRHVQVVNS ELS 


53 


M 


AB030896.1 


1 ... 906 


396 


OR51Kn 
P 


6 


11 


4.15 


MHSCSGKLPLLGIVNFL 


51 


M 


NM 013617. 
1 


1 QO 1 
1 ... i?Z 1 


397 


OR52Qn 
P 


10 


11 


4 .58 


MYTGSVRFPFLFVAVGI 


45 


M 


AF121979. 1 


53 ... 
1106 


398 


OR4 Fn 


0 


15 


86.21 


IHGGMIIHIQFVNSISA 


50 


M 


AF102522. 1 


40 . . . 

c c c\ 

660 


399 


ORllMn 
P 


1 


12 


41. 92 




48 


M 


AL359381.1 


175785 
176720 


400 


OR52Nn 


0 


11 


4 .44 


MHTGSARL P FLGVAI G F 


57 


M 


NM 013619. 
1 


118 . . . 
969 


401 


OR56An 


0 


11 


4.58 


MNLAS FRMPILQGGLLS 


73 


M 


AF121981 . 1 


89 ... 
475 


402 


OR5AWn 
P 


14 


X 




LXADFTSNLPTTSSNW 


39 


R 


X80671 . 1 


20i . . . 
1129 


403 


OR52Nn 


0 


11 


4.51 


MHTGSARLP FLGVAI GF 


55 


M 


AF121976.2 


A 1 A 

4/4 ... 
1307 


404 


ORnP 


15 


X 




ISCIFELTLPLPSNVNV 


31 


M 


AC073947 . 3 


2yiy^ . . ■ 
30115 


405 


OR52En 
P 


6 


11 


4.58 


VHSVSVRMPILGNIILL 


62 


M 


AF121979 . 1 


DJ . . . 

1106 


406 


0R5BHn 
P 


9 


X 




MVASCGGKTVSLCGTLT 


40 


M 


NM 013728. 
1 


1 Q4R 

1 ... y h o 


407 


OR4QnP 


1 


15 


1. 66 


IHGAMAGHMQLMNSLSV 


60 


M 


AC019272 . 4 


OZZDO . . . 

61317 


408 


OR51En 


0 


11 


3.04 


MHSGSARLPLFGVIAIL 


60 


R 


At u / yy 0 4 . 1 


1576 


409 


ORllKn 
P 


o 

z 


1 D 


1 66 

1 . DO 


F^nYGFCITLLITFVFI 


53 


M 


AF121972.1 


171 ... 
1109 


410 


OR12D1 
P 


1 


6 


33.02 


LHGSATIHLHMSTGIAG 


76 


M 


AL133159.4 


16108 . . . 
15185 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


411 


OR4NnP 


3 


15 


1. 61 


LHGGGAGH IQLMNSMTM 


55 


M 


AC019272.4 


62255 ... 
61317 


412 


OR11A1 


0 


6 


33.02 


FGATCTSVLVLTLSCLI 


76 


M 


AL359381.1 


175785 
176720 


413 


OR10C1 


0 


6 


33.02 


MLGACSCVGH FIAT LIC 


59 


M 


AL365336.1 


122764 
121784 


414 


OR2H1 


0 


6 


33. 02 


LLGTCVMQVQSLbbt V V 


ft ft 

O 0 


M 


AT f)7fl fi"^0 1 

rtijV / O U J U • A 


48786 
47851 


415 


OR9RnP 


8 


12 


59.71 


LAVGGGCNIQFLLS ITT 


54 


R 


AF091579.1 


7 ... 663 


416 


OR4FnP 


0 


7 


0. 53 


VLHFQFVNSICG 




M 


/\D U O U O -7 D * X 


1 ... 906 


417 


OR7D4 


3 


19 


11. 31 


VMAGTAI FVHLLATLGr 




K 


nrn qi c;on i 


7 ... 663 


418 


OR7E25 
P 


3 


19 


11.31 


MIACSVLDLHIVIGFGL 


C 1 


K 


n pn QI ^RO 1 


7 ... 663 


419 


OR2D2 


0 


11 


5. 69 


LLGCCGSWDFITGIL1 


DJ 




nc U / J -JO / .J. 


2 ... 649 


420 


ORlOAn 


0 


11 


5. 69 


MFGVCAPWQWAGTWI 


76 


M 


AF247657.1 


1 ... 945 


421 


OR2WnP 


3 


1 


254.49 


LLGGCVCQGHWVLAWS 


54 


R 


L34074.1 


73 . . . 
1011 


422 


OR7E16 
P 


8 


19 


11.31 


IAGCDLLDLHIMLALGL 


60 


M 


AF102536.1 


22 . . . 
669 


423 


OR52Pn 


0 


11 


4.44 


MHCMSARLPCLGAAVIV 


59 


M 


AF121976.2 


474 ... 

1 ^07 


424 


OR 6 An P 


4 


11 


5. 66 


LLGCCGGIVKLDLAILG 


94 


R 


M64386.1 


130 ... 


425 


OR7D2 


0 


19 


11.24 


VMPITVITLHLIMTLGF 


61 


R 


AF091580.1 


1 ... 663 


426 


OR52Un 
P 


3 


11 


4.44 


LHSASVRFPMLGVAVAY 


52 


M 


AF121976.2 


474 ... 

1 307 


427 


OR2AGn 


0 


11 


5.6 


MLGGDTLSIYYVMGFLP 


55 


M 


AF102527.1 


22 . . . 
669 


428 


OR7G3 


0 


19 


11.24 


I LVGNLVDLHMWTLGV 


64 


R 


AF091580.1 


7 ... 663 


429 


OR56Bn 
P 


3 


11 


4.44 


I HVGS FRFPVLQLAGMS 


41 


M 


AF133300.1 


25713 ... 
26573 


430 


OR2AGn 
P 


1 


11 


5.51 


MLGSDTLIGHYITGFLL 


55 


M 


AF102527. 1 


22 . . . 
669 


431 


OR56Bn 


0 


11 


4.44 


MHVAS FRCSVLQLALMS 


39 


M 


NM 013619. 
1 


118 ... 
969 


432 


OR 6 An P 


5 


11 


5.51 


LLGCCGGIVKLDLAILG 


93 


R 


M64386.1 


130 ... 
975 


433 


OR4FnP 


4 


19 


63.23 


IHGGMVLHFQFVNSICG 


49 


M 


AB030896. 1 


1 ... 906 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


ACC 


Range 


434 


OR6Wn 


0 


7 


148 .04 


SFGSFAVSSPQDLSFVT 


47 


M 


NM_010991. 
1 


1 ... 939 


435 


OR4Mn 


0 


15 


1.59 


LHGAMLGH I QLMSS I SV 


52 


M 


AF259072.1 


104176 
105099 


436 


OR52Yn 
P 


13 


11 


3.6 


WVWLQWPVMGMAVDF 


29 


M 


AF 1 J J JUU . 1 


H DODl . . . 

47498 


437 


ORllHn 
P 


2 


15 


1.78 


FFGTCLCWI PLCLSVIG 


61 


M 


AF1.21972 . 1 


171 . . . 

1109 


438 


OR 9 An 


0 


7 


148.04 


LSGT FVFS W PALMAI LG 


46 


M 


NM_010991. 
1 


1 ... 939 


439 


OR5Mn 


0 


11 


52.19 


CILLFFYDFQLMSANLS 


50 


M 


AC069563.9 


129775 
130725 


440 


OR6Vn 


0 


7 


148 . 04 


FFGSFAAAPTSDMAFVS 


45 


M 


NM 010991. 
1 


1 ... 939 


441 


OR4Nn 


0 


15 


1. 61 


LHGGGAGHIQLMNSMTL 


53 


M 


AC019272. 4 


62255 . . . 
61317 


442 


OR51An 
P 


4 


11 


3.6 


EHTDSLILPFTGLACMS 


43 


M 


NM 013617. 
1 


1 ... 921 


443 


OR9PnP 


10 


7 


148.04 


FGSNSFEHLVFIHSLLM 


39 


M 


NMJD10983. 
1 


178 ... 
975 


444 


OR4H6P 


3 


15 


1.66 


MHGC I LGH VQLVNS I SG 


59 


M 


AF259072. 1 


104176 
105099 


445 


OR51Fn 
P 


2 


11 


3.6 


MHTFSLRLPLLGDLTTI 


48 


R 


AF079864 . 1 


632 . . . 
1576 


446 


OR7E1P 


3 


11 


68.1 


MVACGVLDLH I I OS FGL 


55 


M 


AF073989. 1 


547 ... 
1515 


447 


OR51Tn 


0 


11 


3.6 


MHSLSVRFPLAGLQLNT 


44 


R 


AF079864 . 1 


632 . . . 
1576 


448 


OR2Vn 


0 


13 


104.15 


IWGGSFDIQVICCMLF 


84 


M 


AF102535. 1 


16 ... 

669 


449 


OR51Hn 
P 


7 


11 


3.6 


MHGGSARAPVLGAVI I L 


51 


R 


AF079864 . 1 


C T o 

632 . . . 
1576 


450 


OR51An 


0 


11 


3.6 


EHTVSIRLPFTGIACTL 


48 


M 


AF071080.2 


26330 . . . 
2 i IbZ 


451 


OR2AIn 
P 


2 


5 


209.13 


YLGSCLSNFHLMARILL 


55 


M 


AC044846.2 


112743 
113748 


452 


OR2F2 


0 


1 


1 A Q 1 A 

1 h o . In 




54 


M 


AF073974 . 1 


41 ... 

64 9 


453 


OR1F12 


0 


6 


31.61 


MMANNAINLHMVTVI FV 


58 


M 


AC023167.7 


60743 
61663 
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ID # 


C \rrn ho 1 




c 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 






o 


19 


11.24 


I LAGSLMDVQMI AS FGI 


60 


R 


AF091580.1 


7 ... 663 


455 


OR7G2 


0 


19 


11.24 


ILAGNLTNLLMIAAFGV 


61 


R 


AF091580.1 


7 ... 663 


456 


0R1M1 


0 


19 


11.24 


MHGISAFITHLIVAVIT 


89 


M 


X89689. 1 


32 ... 
472 


457 


0R51Un 
p 


1 


11 


2.89 




48 


R 


AF079864 . 1 


632 . . . 
1576 


458 


OR52Hn 


0 


11 


4.19 


MHFVSGRIPDLGVPTVS 


59 


M 


AF121975.1 


50 ... 
1012 


ACQ 




V 




vj . J > 


MFVDNGVNLHLI EGVMT 


75 


R 


M64377.1 


1 ... 939 


4 60 


ORlOPn 

D 

r 


0 


16 


87.09 


MIGICTTTTHLVATFII 


48 


M 


AF247657.1 


1 ... 945 


461 


OR4FnP 


4 


19 


7.9 


IHGGMVLHFQFVNSICG 


49 


M 


AB030896.1 


1 ... 906 


462 


0R2T1 


0 


1 


254 .77 


H L VG FGG DLL I MCCML I 


92 


M 


AF102527.1 


22 ... 
669 


463 


OR7EnP 


9 


19 


22.8 


VAGC DLLDLH IMLAFGL 


60 


M 


AF102536.1 


22 ... 
669 


464 


OR51Gn 


0 


11 


3.6 


LHSFSVRLPLMGVITVI 


57 


M 


NM 013617. 
1 


1 ... 921 


465 


OR2Tn 


0 


1 


254.77 


MVAGFGLDTFIMCCMLI 


67 


M 


AF102527.1 


22 ... 
669 


466 


0R5BGn 
P 


2 


11 


51.27 


AAAAAGG S I HNL FAVE I 


52 


R 


U50948.1 


34 ... 
978 


467 


OR5WnP 


3 


11 


51.27 


MGADCLVDI HCMFWAC 


51 


M 


AF146372.1 


509 . . . 
1456 


468 


OR51Sn 


0 


11 


3.6 


MHSVSARLPLLLVLMGD 


42 


M 


AF071080.2 


26330 . . . 
27262 


469 


OR5WnP 


1 


11 


51.27 


LVFIES 


55 


M 


AC074177.4 


107189 
107708 


470 


OR51An 

■n 
r 


3 


11 


3.6 


EHTDSLILLPTGVAMMD 


46 


M 


NM 013617. 
1 


1 ... 921 


471 


OR5Dn 


0 


11 


51.21 


FCG VTGWC I L FC I AN E S 


46 


M 


AF146372.1 


509 ... 
1456 


472 


0R7EnP 


4 


4 


5.55 


MVACGVLDLHI I DS FGL 


54 


R 


AF091580. 1 


7 ... 663 


473 


OR51Fn 


0 


11 


3.6 


MHTFSSRVPVFGALTTF 


53 


R 


AF079864 . 1 


632 . . . 
1576 


474 


OR5Dn 


0 


11 


51.21 


YCVVSGWGVLYLFANEC 


48 


M 


NM 013728. 
1 


1 ... 948 


475 


OR52Rn 


! o 


11 


3 . 6 


wit np P T ni.in nwr\ 7 7\ 'W A IT 


D O 


M 


AF121976 2 


474 . . . 

1307 


476 


ORnP 


27 


11 


51.21 


FCFAAGQSPGFLCFFFF 


23 


M 


AB030893.1 


37 . . . 
930 
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SEQ 
ID # 


Symbol 


D 


c t 

< 


zoord 


ZDK 


% 


S 


Acc 


Range 


477 ( 


Z)R7EnP 


6 


3 


121.47 


MVAC DVLDLH 1 1 DS FSL 


57 


M 


AF073989 . 1 


04 / ... 

1515 


478 


OR6Qn 


0 


11 


54 .04 


LTGACAVTLPLDVSVLA 


52 


M 


NM 010983. 
1 


178 .. . 

975 


479 


OR4Fn 


0 


6 


185.89 


I HGGMVLHFQFVNS ICG 


51 


M 


AB030896.1 


1 ... 906 


480 


OR7EnP 


3 


13 


40.31 


FFSP . AAALHIMPAFGL 


65 


M 


X89686. 1 


5c ... 
472 


481 


OR7En 


0 


2 


95.17 


MVAC DVLDLH I I DSFGL 


57 


M 


AF073989. 1 


547 . . . 
1515 


482 


OR4Nn 


0 


14 


0.27 


LHGAMVGHVQLMNSLSL 


58 


M 


AC019272. 4 


62255 - - . 
61317 


483 


OR2ASn 
P 


7 


1 


254.77 


GGGGGMICGLLP 


43 


M 


AF102535. 1 


16 ... 
669 


484 


ORllHn 


0 


14 


0.33 


FFGTCFIGIPYFQSVLF 


90 


M 


AF121972. 1 


171 . . . 
1109 


485 


OR2Tn 


0 


1 


254.77 


MLAG FGLDML I MCCML I 


69 


M 


AF102527. 1 


22 ... 
669 


486 


OR2TnP 


1 


1 


254.77 


CMMGFSGDLLIMCCMLI 


77 


M 


AF102527. 1 


22 ... 

669 


487 


OR2AKn 
P 


3 


1 


254.55 


TLGGACSNIHYVSGILL 


50 


M 


AF102533. 1 


16 . . . 

669 


488 


ORnP 


16 


12 


4.38 


VLKSKCWQLPFYMPLLM 


25 


R 


Y07557.1 


1 ... 942 


489 


OR5DnP 


4 


11 


51.21 


FCAVTGWSTLFCIANES 


48 


R 


U50948.1 


34 ... 
978 


490 


OR7EnP 


1 


4 


5.55 


FVACDVLDLHI I DNFGL 


54 


M 


AF102536. 1 


22 ... 
669 


491 


OR5L2 


0 


11 


51.27 


FCGWCCC I HLLVANE V 


53 


M 


AF146372. 1 


509 . . . 
1456 


492 


OR5Dn 


0 


11 


51.27 


FCWLWCTLSLVANES 


48 


M 


NM 013728. 
1 


1 ... 948 


493 


ORnP 


4 


9 


81.99 


. . CCCLFFQSIASGTYI 


23 


M 


AL359381. 1 


82137 . . . 
8154 4 


494 


ORlOQn 


C 


11 


54.08 


MVGSCGLPQLLLVSVLI 


50 


M 


AL365336. 1 


123248 
124093 


495 


OR9MnP 


1 


11 


51.27 


LCVDSGGSIHNLFAVEI 


54 


M 


AC069559. 8 


737Q4 . . . 
74636 


496 


OR7E62 
P 


c 


) 2 


73.96 


MAACDVLDLHTI DS FRL 


56 


M 


AF073989.1 


547 ... 
1515 


497 


OR9LnP 


i: 


\ 11 


54.06 


> MFVGCTLVAYGILTMIA 


32 


M 


AC069561.1 
0 


147203 
146274 
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SEQ 
ID # 


Symbol 


D 


c 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


498 


OR7E4 6 
P 


10 


2 


73.96 


MAGVEFCDLHIMPAFGL 


54 


M 


AF102536.1 


22 . . . 
669 


499 


OR1S1 


0 


11 


54 .08 


MIVVNILITHLLVGVIF 


56 


M 


AC073769.1 


133488 
132556 


500 


OR5DnP 


0 


11 


51.21 


FCVIMGWCTLSCISSEC 


45 


M 


AC069563.9 


111696 
112671 


501 


OR9InP 


4 


11 


54 .06 


FTASCGGNICCISAVIT 


46 


R 


AF091579.1 


7 ... 663 


502 


OR5Dn 


0 


11 


51.21 


FCWSGWCELSLLANES 


53 


M 


AF146372.1 


509 ... 
1456 


503 


OR9QnP 


4 


11 


54 .08 


FTASCGAS VRT I FAVMA 


47 


M 


AL365337.1 


192661 
191711 


504 


OR51Cn 
P 


0 


11 


3.04 


MKTVSARMPMLGAMTW 


51 


R 


AF079864 .1 


632 ... 
1576 


505 


OR5WnP 


1 


11 


51.27 


FCADCGVDIHL 


53 


M 


AC069561.1 
0 


127636 
126698 


506 


OR9InP 


2 


11 


54.06 


FTAGC S CGLHC I CAM FA 


46 


M 


AC074177.4 


106297 
105361 


507 


OR51An 
P 


4 


11 


3.04 


MHSVSARVPVPGWTGL 


72 


M 


X89685.1 


2 ... 481 


508 


OR5L1 


0 


11 


51.21 


FC WVCCC I HLLVANEV 


55 


M 


AF146372.1 


509 ... 
1456 


509 


OR7EnP 


5 


13 


50.42 


WDLHIMPAFGL 


66 


M 


X89686.1 


32 . . . 
472 


510 


OR5BLn 
P 


18 


11 


54.08 


I LGNXLENQCFI FAMI T 


29 


R 


M64392.1 


1 ... 942 


511 


OR51En 


0 


11 


3.04 


MHSASVRFPLLGAIVMV 


95 


R 


AF079864.1 


632 ... 
1576 


512 


OR51Dn 


0 


11 


3.04 


MHSASSRFPLIGIIVMV 


61 


R 


AF07 98 6.4.1 


632 ... 
1576 


513 


OR52In 


0 


11 


3.04 


MHTATARFPLMSGSMVS 


46 


M 


AF121975.1 


50 . . . 
1012 


Dl 4 






X 0 




IHTGMIVHSOFIDSLSS 


56 


M 


AB030896. 1 


1 ... 906 


515 


OR52In 


0 


11 


2.99 


MHTATARAPLMSGSMVS 


47 


M 


AF121975.1 


50 . . . 
1012 


516 


OR4KnP 


2 


18 


19.04 


IHNGIVVHSQFMTSIAI 


55 


M 


AB030896. 1 


1 ... 906 


517 


OR52Mn 
P 


1 


11 


3.04 


MHATS VRYLP I G I GVLL 


51 


R 


AF079864.1 


632 ... 

1576 
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SEQ 
ID I 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


518 


ORnP 


7 


6 


31.58 


FLVSCLLLLLLLEGIHW 


30 


M 


AF073964 .1 


41 . . . 
649 


519 


ORnP 


9 


8 


88 .25 


IXWVLNIVNMTTIIFL 


24 


M 


AC074177.4 


149899 
148964 


520 


ORnP 


9 


10 


70. 63 


YSIVMFYHAHFICELLN 


26 


M 


AC068902.1 
1 


144125 
143193 


521 


ORnP 


9 


9 


70 . 7 


WWWWWSWYGNFDDSITX 


26 


R 


AF091563 . 1 


7 ... 669 


522 


ORnP 


9 


5 


202.43 


FFFFF.PPPPP 


27 


R 


AF034902.1 


4197 ... 
5177 


J 4- -> 


DRn P 


1 0 


\\ 


137 .77 


T.T.T.T.W^OFXOFLAWVV 


29 


R 


M64376 1 


1 ... 999 


524 


ORnP 


3 


11 


16.31 


NNNNNLLXMNILTLLAI 


27 


M 


AL136158.1 
4 


29455 ... 
30402 


JZ J 




1 7 


1 1 

A. x 


S S 6 


T.AHNNT YCYHM LLLL 


2 6 


R 


M64377 1 


1 ... 939 




\J L\\J ill 


n 




1 6 

1 J 1 * . u 


T,T ArrA^^MKFDLAMTT, 

LiJ. A^UAOJlirvr UJjniJllj 


60 


M 


NM 010983 

1 


178 
975 


^97 




•a 
■j 








54 


R 


AF091580 1 


7 ... 663 


528 


ORnP 


12 


11 


138.51 


LMCHS . FFFFFMMMMMM 


29 


R 


AF091573. 1 


7 ... 663 


529 


OR7EnP 


5 


14 


33.48 


MAGGDFLDLYILPDFGL 


55 


M 


AF073989.1 


547 ... 
1515 | 


530 


ORnP 


7 


10 


127.4 


S.CCCLLTYIIHHHHHH 


31 


M 


AC020958. 1 


164590 
163746 


531 


ORlOXn 
P 


2 


1 


154 . 6 


MLGGCSAITELIISGLG 


49 


M 


AC073778.1 


168744 
167803 


532 


ORlOZn 


0 


1 


154 .71 


MAACCTTFGMVILSVLV 


56 


M 


AC025913.3 


108128 
109067 


533 


OR6KnP 


2 


1 


154 .73 


MYG I VGCT PEWWHE I T 


40 


R 


M64386.1 


130 ... 
975 


534 


OR6Kn 


0 


1 


154 .73 


MHGIVSCTPEWVIHEIT 


44 


M 


AC027184.3 


54955 . . . 
54017 


535 


ORlFnP 


1 


4 


97.57 


IEGVMT 


73 


R 


M64377.1 


1 ... 939 


536 


ORlABn 
P 


3 


19 


19.44 


MIGISAFNTHLV 


64 


M 


AC073769. 1 


133488 
132556 


537 


0R52Mn 
P 


1 


11 


2.89 


MHATSARYLPIGIGVLL 


49 1 




ZVF121975.1 


50 . . . 
1012 


538 1 


ORlXnP 


6 


5 


202.43 


MIANTLGIVHIFAALFA 


71 I 




<VF102530.1 


1 ... 666 
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SEQ 
ID « 


Symbol 


D 


C 

c 


lb c 
:oord 




I i 


5 ; 


\cc I 


*ange 


539 C 


)R4FnP 


8 


16 


83.04 \ 




46 I 


4 1 


\C019272.4 ( 


52255 ... 
51317 


540 C 
I 


)R52Mn 


5 


11 


2.891 




45 


R i 


^F079864.1 


632 ... 
1576 


541 ( 


3R2Vn 


0 


5 


209 . 61 




83 




AF102535.1 


16 . . . 
669 


542 ( 


DR2V1P 


4 


5 


209. 61 


t t n iff* C TTHT HZ^T PPMT.T. 


90 


M 


AF102537.1 


16 . . . 

669 


543 


DR2Zn 


0 


19 


65. 55 


I TG V G S VN 1 \J x -Lj ^ « 1 


76 


M 


AC073769. 1 


54319 . . . 
55289 


544 


OR52Kn 
P 


5 


11 


2.89 


AMFIEL 


52 


M 


AF121975.1 


50 . . . 
1012 


545 


ORlOHn 


0 


19 


19.7 


MFGFSWGMMVlbJ-iV I Al 


7 5 


M 


AC023604 .2 


214343 
213396 


546 


OR2Dn 


0 


11 


5.77 


ILGCCRSVVDFIMGILA 


85 


M 


AF073987.1 


2 . . . 649 


547 


OR7EnP 


6 


2 


161.49 


WGGCSSDLHIMrAr bL. 




M 


X89686. 1 


32 . . . 
472 


548 


ORllGn 
P 


4 


14 


0.27 


FFGSCSLWIPVbLbbJ-il 


68 


M 


AC027184 .3 


54955 . . . 
54017 


549 


ORnP 


12 


14 


0.27 


GSCGNSLHniLMVNX XX. 


28 


M 


AF121972 . 1 


171 ... 
1109 


550 


ORllGn 


0 


14 


0.33 


FFGSCNLWIPNr Lbrvn 


o / 


M 


AF121972 . 1 


171 ... 
1109 


551 


ORllHn 
P 


5 


14 


0.33 


FTGTAFFSVSQc LbX xx» 


68 


M 


AF121972.1 


171 ... 
1109 


552 


OR6Kn 


0 


1 


154 .73 


MHENGGFIPbMUnAl xx 


4 6 


R 


AF034897.1 


354 ... 
1199 


553 


ORllHn 


0 


14 


0.33 


FFGTCVGCVPLCFN I I G 


71 


M 


AF121972.1 


171 ... 
1109 


554 


OR6KnP 


c 


1 


154. 12 


MHGNGGFVPEWDHAAIF 


46 


M 


AL365336.1 


122764 
121784 


555 


ORllHn 
p 


2 


! 14 


I 0.32 


1 FpGTCLIblor r vor xjj 


70 


M 


AF121972.1 


171 . . . 
1109 


556 


OR6KnP 




> ] 


i 154. s: 


I MHG VAG FM P EC DRAS I T 


43 


M 


AC027184 .3 


54955 ... 
54017 


557 


OR6Kn 


( 


) : 


L 154.8' 


\ MHGISGCLPEWVIHEIA 


45 


R 


AF034900. 1 


1 ... 963 


558 


OR2Ln 


( 


) : 


L 254.55 


5 SSGGAGINAHYVSTFLF 


52 


\ M 


AF102527 . 1 


Cc. . . - 

669 


559 


OR4GnP 


1 


3 i 


5 83.0' 


1 ICRKMAVHSQFVNSISA 


45 


> M 


AB030892.1 


1 ... 939 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


4 
Z 


c 


& c 
nut 


x\u u y c 


560 


OR6Nn 


0 


1 


154 . 84 


IHGACGGGVELDINK1 A 


DV 


D 

r\ 


no 4 o \j • x 


130 
975 


561 


OR2LnP 


2 


1 


254 . 55 


r> t t\ i rp /"* T XI 7\ LI V W 




M 




1 6 
669 


562 


OR9A1 


0 


7 


146. 91 


LLGTLVLSWPALMAI lb 




M 


t i 4 c an i 

L14 JO f • X 


1 7 
667 


563 


OR6Nn 


0 


1 


155. 69 


THGACACCSELDINIII 


51 


M 


AL136158.1 


29455 . . . 


564 


ORlOHn 


0 


19 




MFG FSCGMVVAGLVTAL 


86 


M 


AC023604 .2 


245345 
246298 


565 


OR7EnP 


4 


9 


71.72 


MVACDVLDLH IMNS FGL 


57 


M 


At u / j y o y . 1 


Oh 1 ... 

1515 


566 


OR2AQn 
p 


5 


1 


155. 69 


FCHSCLLLLSLLPFFFF 


31 


M 


AL359352 . 1 


0 3 0 0 0 ... 

56546 


567 


OR2LnP 


3 


1 


254.55 


S MAG AG I N AH YVS S FL F 


50 


M 


AF102537 .1 


16 . . . 
669 


568 


OR5ARn 


0 


11 


52.46 


FWDCGASAHLLLCIES 


53 


R 


AF091579.1 


7 ... 663 


569 


OR7EnP 


4 


9 


71.79 


TAGGETLDLHIMPAFGL 


57 


M 


AF102536 . 1 


11 ... 
669 


570 


OR10AA 
nP 


2 


1 


155. 69 


THGMCAAAVPLHVIATC 


84 


M 


AC005 992 . 1 
5 


y j. ± q ... 
8173 


571 


ORlOJn 
p 


4 


1 


157.7 


MIAICGVVVQSNVSVIV 


72 


M 


X92969.1 


8035 . . . 
8961 


572 


OR5A1P 


0 


11 


55.81 


FVGLCGGSIQSNVVVGT 


81 


M 


Y15525.1 


1 ... 705 


573 


OR2AHn 
P 


5 


11 


52.4 6 


MLGSCISSVILVFSIVI 


51 


M 


AF24 7 657 . 1 


j. ... y^o 


574 


ORlOJn 
p 


4 


1 


157.7 


LLGICGIMVQSNVSVLL 


68 


M 


X9296y . 1 


oUJo . . . 
8961 


575 


OR5 6Bn 
P 


2 


11 


4.93 


IHMCSSRLPVLQLVWS 


39 


M 


AF121975.1 


50 . . . 
1012 


576 


OR5M1 


0 


11 


52.35 


CIVIFIYSSQLMVANLS 


49 


R 


AF091579.1 


7 ... 663 


577 


OR52Wn 
P 


0 


11 


4.93 


MHTASLLAVPLGLS I SM 


48 


M 


AF121976.2 


474 ... 

X JU / 


578 


OR5AMn 
P 


5 


11 


52.35 


FI VI YAYNVQLMVANLC 


35 


M 


AC068904.1 

c 
O 


113793 
114719 


d i y 


P 


j 


11 


4 . 93 


MH FVSTQT PVLG VPS W 


89 


M 


AF121975.1 


50 . . . 

1012 


580 


OR5MnP 


1 


11 


52.35 


CVLLYFWVMQLLSANLV 


48 


R 


X80671. 1 


203 ... 
1129 
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SEQ 
ID # 


Symbol 


D 


c 


coord 


CDR 


a 


c 




Range 


581 


0R5APn 
P 


6 


11 


52 . 35 


FGAGGALNlHt I r ANhb 


OO 


p 


X80671 1 

AU u w ' J. • X 


203 . . . 
1129 


582 


OR56Bn 


0 


11 


4 . 95 


IHFCot KLrVJjUJ-LftJj v o 


A 1 


M 

ri 


AF121975 1 


50 . . . 
1012 


583 


OR5APn 


0 


11 


52 . 35 


FGLGCTANI HM1 t bl Vb 


O O 


M 


AF121977 1 


262 . . . 
1197 


584 


OR52Bn 


0 


11 


4 .93 


GHFVSARI PVLGVPMVL 


73 


M 


AF121975.1 


50 . . . 

1012 


585 


OR9Gn 


0 


11 


52.5 


FAAYCVGN I I KMLLNVC 


45 


M 


AC074177.4 


106297 
105361 


586 


OR52Kn 


0 


11 


2.86 


MH S I S ARL PLLGVAS VL 


53 


M 


NM_013619. 

1 


118 ... 
y oy 


587 


OR5MnP 


1 


11 


52.35 


FIVIYAYNSQLMVANLC 


51 


M 


AC074177.4 


106297 
105361 


588 


OR52Kn 


0 


11 


2.86 


MHSISARLPLLGVAIVL 


52 


M 


1 


1 1 p 

llo ... 
969 


589 


OR52Kn 
P 


3 


11 


2.82 


MHS I SARLPLLGVAIGL 


53 


M 


NM oiJoiy. 
1 


1 lo ... 

969 


590 


OR52Bn 
P 


4 


11 


2.78 


IHFI S ARVP DLGVLTVL 


57 


M 


Ar Xz. iy /o.J. 


ou . . . 
1012 


591 


OR2B6P 


0 


6 


31.62 


LLGAYATNWLLLVS FHI 


7 9 


R 


L J 4 U / H . 1 


/ 3 ... 

1011 


592 


OR2WnP 


7 


6 


31.61 


LLRGCASNVMLAFAIVL 


58 


M 


AFlUzolb . 1 


DZ, . . . 
669 


593 


OR2AnP 


5 


7 


148.83 


TMAHCTCLVHLISSILG 


"7 O 

/Z 


M 


At 1 UZDZl . 1 


9 9 

669 


594 


ORnP 


16 


6 


31.61 


FLVSCMDFMYIVLNNVI 


39 


M 


At 1 UZd X b . 1 


D £ ... 

669 


595 


OR2LnP 


0 


1 


254.55 


STAVAGINAHYVSAFLF 


C A 

50 


M 


At 1UZ0Z / . 1 


9 9 

Z Z ... 

669 


596 


OR2W2P 


5 


6 


31.61 


LLGGC VCQS YWVLS I VM 




K 


T ^ a m A 1 


/ ... 

1011 


597 


OR2LnP 


1 


1 


254.55 




61 


M 


AF102535.1 


16 . . . 

DO? 


598 


OR2B7P 


1 


6 


31.61 


LLGGCTTNIQLIVSFLV 


59 


M 


AC044846.2 


105668 
104736 


CQQ 

oy y 




n 
u 


I 


254 . 43 


SLGGAGINAHYVSAFLF 


53 


M 


AF102527.1 


22 . . . 
669 


600 


OR5BFn 


C 


1 


254.77 


WV YLAS YMH S I SAVGG 


46 


M 


AL359352.1 


9138 ... 
8177 
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ID # 




D 


c 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


601 


OR2LnP 


4 


1 


254.55 


SVAGMSMDAHYVSTFLF 


47 


M 


AF102527.1 


22 . . . 
669 




OR7FnP 


3 


10 


17.14 


MVACCVLDLHI 


51 


R 


AF091580.1 


7 ... 663 


603 


OR1H1 


2 


9 


106.04 


LGADNVI HVHLLVALLA 


57 


M 


AC073769.1 


133488 
132556 


604 


ORnP 


14 


1 


254 .49 


TTTKKSERIYIVSSFLI 


24 


M 


AF102527.1 


22 ... 
669 


605 


OR4 Dn 


0 


11 


55.81 


I H GG I AS H I QLMNNVTL 


64 


M 


AC019272.4 


183633 
182701 


606 


ORILn 


0 


9 


106.04 


MYGNSFFHLHLQEAVLT 


54 


M 


AC023167.7 


60743 . . . 
61663 


C C\ "7 

bU / 




n 
u 


i 




VT FAYGGVGLSS 


47 


M 


AL136158 . 1 
4 


154973 
155908 


bUo 


UKDAH 


n 
u 


1 1 


77 




64 


M 


Y15525. 1 


1 . . . 705 


609 


OR5AYn 


0 


1 


254 .2 


LVAG I LNLLYGS I G YAS 


50 


M 


AL359352.1 


126933 
127889 


blU 


OKI Jbn 


n 
U 


1 
x 




T TT ftMMTNVHT.VADLAG 


59 


M 


AF102540 . 1 


16 . . . 
669 


bl 1 


An C DDr> 

P 


u 


1 x 


^ ^ 7 7 


YA^T.rftCI^VHPLEAVGG 


54 


M 


Y15525 . 1 


1 ... 705 


biz 


UKybnr 


D 


1 1 


DZ . H -/ 


FVywrAfiNT TET.MTiNTT 

C V Alx^.rtUll ± X HiUi JDll X i. 


47 


M 


AF121977 . 1 


262 ... 
1197 


613 


OR2TnP 


4 


1 


254 .77 


HLAGFAGNLLVMCCMLI 


75 


M 


AF102527.1 


669 


614 


ORnP 


7 


1 


255.42 


PVAGKGAFLHSVESLGS 


38 


M 


AL365337. 1 


192661 
191711 


615 


ORUn 


0 


9 


95.9 


MITDSVLSSHLMVGVIL 


66 


M 


AF102524. 1 


52 . . . 
669 


DID 




i 

X 


J. u 


6.47 


LLG AC I G N I Q FL VC FT V 


85 


M 


M84005. 1 


1 ... 936 


DJL / 


HD Ofn P 

UKrbn r 




1 1 

X J. 




FAAYCYGNILNLLLNVS 


49 


M 


AL365337.1 


192661 
191711 


618 


OR2C1 


0 


16 


6.4 


LLGACIGN I QFLVC FT V 


85 


M 


M84005. 1 


1 ... 936 


619 


0R51An 
P 


2 


11 


4.22 




52 


M 


AF071080.2 


26330 . . . 
27262 




620 


0R9Gn 


0 


11 


52.49 


LCAYCGGNAHNLVVTVS 


53 


M 


AC068904.1 
5 


165039 
165965 



122 



WO 01/27158 
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crn 
ID # 


C \mihn 1 
o yiiuju ± 


n 
u 


c 


Mb 

coord 


CDR 


% 


s 


Acc 


Ranae 


621 


OR52Bn 


0 


11 


2.78 


LHFISTRTPILGILTVL 


61 


M 


AF121975.1 


50 . . . 
1012 


622 


0R1K1 


0 


9 


105.89 


MFGVSMVHLYLI EGVVT 


58 


R 


M64377.1 


1 ... 939 


623 


v[\J 1 XVI 1 
P 




i 1 

X X 


2.78 


MHTYSARLPGLGSISLL 


47 


R 


AF079864 1 


632 . . . 
1576 


694 


OR7FnP 






S4 R ? 


MVArnVLDLHILDSFGL 


57 


M 


AF073989 1 


547 
1515 


69 


vja d z rn 
P 


O 


1 1 
1 1 


9 R 9 


nil O/AOrVIMj t LiXjU/vi v v 1 


•j *j 


M 

Li 


API 9 1 Q7 S 1 

nE 1l 1^7 / J ■ i. 


•J V ... 

1012 


69£ 


\Jt\ 1 LiiiC 


c 


q 




MvarnvnYVH^MD^FnT. 

livrt^L^vv^i v nji.iL'iji ui-i 


4 ft 


M 




99 

669 


627 


OR7EnP 


5 


9 


70.7 


TAGGD.CCCCC 


43 


M 


AF073989. 1 


547 ... 

1 D 1 D 


bZ O 




1 


0 1 
Z 1 


0 . 1Z 


TUTrMTUHQOrTncJT c?c 
i.nluMi vnoyt x uo wo 


R7 


M 

1X1 


np9^Qn79 1 
r\C ZD y\) 1 Z . X 


105099 


629 


OR4KnP 


2 


21 


8.12 


IHNGI WHSQFMTSTAT 


54 


M 


AB030896.1 


1 ... 906 


con 

bDU 


UK / Lnr 


D 


A 

y 


"7 r\ *7 
/ U . / 


17TTT VU Q\7P7A TTPT 


DO 


M 


AOjOOO. 1 


^9 

472 




ukd l in 


A 

U 


1 1 


4 . Id 


w it C E > CPk rr PDIT r \7r , X/'T TVM 


D 1 


K 


7\rfi7 QQ CA 1 
At U / 370b4 . J. 


boz . . . 
1576 


b dz 


An CI Tn 

uko i in 


A 
U 


1 1 


/ 1 c; 
4 . ID 


MHbMobK 1 ^IjIjVj V ij ltM 


D 0 




a rm QfiC/i i 

nt U / zJOufl . 1 


bDZ . . . 

1576 


coo 


UKz/ulr 


1 


n 
I 


1 4 0 . 0 J 


TT B T/""T ITT 


D J 


M 

1*1 




9 9 

z z ... 

669 


b 


UKzAz 


O 

Z 


1 


1 A Q Q "3 


1 LHVL lL-ljVnijl iv^, v Jjo 


D 0 


M 




9 9 

Z Z ... 

669 




An O 7\ — d 


Q 
0 


1 


1 4 0 . 0 J 


TrnnrTPT \/ut t/tput c 
1 t/iALJ bij vnijj. i u vijvj 


DO 


M 


rtr l \j z d z x . i 


99 

zz . . . 

669 


bjb 


UKZbn 


u 


1 


ZJD. DO 




D -7 


M 


MM DOP *7 £9 

1 


1 Q ^ £ 
1 ... jjD 


637 


0R2AnP 


0 


7 


148.83 


TLAHCAFFFFL 


57 


M 


AF102521.1 


22 . . . 
669 


638 


0R6Fn 


0 


1 


254 .2 


MFGCYGCAVPLAIAVIS 


71 


R 


M64378.1 


1 ... 933 


639 


0R2AnP 


4 


7 


148.83 


TLAHCAFLVHLISCILG 


68 


M 


AF102521.1 


22 . . . 
669 


640 


0R2Gn 


0 


1 


256.02 


LLGSCISSIHFLVSFVI 


63 


M 


M84005.1 


1 ... 936 


641 


OR7E37 
P 


5 


13 


26.5 


MAGGEFLDLHIMPAFGL 


57 


M 


^F073989.1 


547 . . . 

1515 


642 


0R5AVn 


0 


1 


256.02 


AMATVMSCMHAVFGLVI 


51 1 




^L359352.1 

\ 


9138 ... 
3177 
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SEQ 
ID # 


Symbol 


D 


c 


Mb 

coord 


CDR 


% 


S 


ACC 


Range 


643 


OR2AJn 
r 


7 


1 


254 .43 


VLLGCGINVHYVSAFLI 


55 


M 


AF102527 . 1 


22 ... 
669 


644 


OR13En 
p 

r 


1 


9 


39.89 


MLGSCLTNLQLLATLTA 


79 


M 


AJ251155.1 


15491 . . . 
16423 


fid s 

0 H D 




o 




257 .85 


FHGACAGTVGLMASFVL 


63 


M 


M84005. 1 


1 ... 936 


646 


OR2TnP 


0 


1 


254 .43 


I PGGCSLDLQAMCCMLV 


59 


M 


AF102537.1 


16 . . . 

669 


647 


OR2WnP 


2 






LMGSCVCNIMQTLGLLV 


56 


M 


M84005.1 


1 ... 936 


648 


OR13Jn 


0 


9 


39.89 


MLGSCALKTEILGSLLV 


82 


M 


AJ251155.1 


6062 . . . 
6997 


649 


OR6RnP 


2 


1 


254 .39 


SFGCFLGLPSLDSSLIS 


45 


M 


NM 010983. 
1 


178 ... 
975 


650 


0R5ATn 


0 


1 


254.39 


VLASLVYIMHGLINLDC 


50 


M 


AL359352.1 


111313 
112242 


651 


OR2Zn 


0 


19 


10. 64 


ITGVGSVNIQILSGILL 


76 


M 


AC073769.1 


54319 . . . 
55289 


652 


OR4Ln 


0 


14 


0.08 


MHGGMLI HSQLVDSLST 


53 


M 


AB030893.1 


37 . . . 

930 


653 


OR4 UnP 


1 A 


1 A 


n i r 

U . l J 


r h 9c;mamh ^olvdslsl 


46 


M 


AB030895. 1 


1 ... 924 


654 


OR4 Fn 


V 


0 


i oc. no 

10 J • -JO 


T HftftMTIHIOFVNSISA 


50 


M 


AF102522. 1 


40 . . . 
660 


655 


OR4FnP 


2 


6 


185.98 


IHGGMAIHVQFVNSISS 


50 


M 


AB030896. 1 


1 ... 906 


656 


OR4 Fn 


u 


O 




THnftMATHVOFVNSISG 

inuo I Ira 1 n v y t vnoxuvj 


50 


M 


AB030896. 1 


1 ... 906 


657 


OR 4 Fn 


0 


6 


185. 98 


I HGGMT I HVQFVNS I SG 


50 


M 


AB030896. 1 


1 ... 906 


658 


OR4AnP 


5 


11 


50.28 


IHGGI LGHVQ FVN D I C V 


65 


M 


AF102522.1 


40 . . . 
660 


659 


OR4LnP 


1 


14 


0.21 


KHGSML I HSQLVDSLST 


53 


M 


AB030893.1 


37 . . . 
930 


660 


OR7E33 
P 


6 


13 


54.79 


MAGGEFLDLRI LPAFGL 


56 


M 


AF073989. 1 


547 ... 

1515 


661 


OR2Cn 


0 


1 


257.85 


FHGACAGTVGLMAS FVL 


63 


M 


M84005.1 


1 ... 936 


662 


OR4Kn 


0 


14 


0.15 


MHGGMSVHSQFVDSLSV 


53 


M 


AF259072.1 


104176 
105099 


663 


OR5U1 


0 


6 


33.45 


VIASVAASMHILFTAAI 


84 


M 


AL359352. 1 


111313 
112242 


664 


OR4 Kn 


0 


14 


0.08 


IHGGMAVHSQFMDSLSS 


58 


M 


AF259072.1 


104176 
105099 
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WO 01/27158 
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SEQ 
ID t 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


665 


OR5V1 


0 


6 


33.45 


LWGCSANVHLLTGIGT 


84 


M 


AL365337.1 


192661 
191711 


666 


OR4QnP 


1 


14 


0.08 


LHGAMAGHVQLMNSISI 


62 


M 


AF259072.1 


104176 
105099 


667 


OR12D3 


0 


6 


33.45 


LHGSAAIYMHMLVT I SG 


70 


M 


AL359381.1 


128169 
127234 


668 


OR4Kn 


0 


14 


0.08 


IHTGMIVHSQFIDSLSS 


59 


M 


AF259072.1 


10417 6 
105099 


669 


OR51Cn 
P 


3 






MKTVSARMPMLGAM1 VV 


0 3 


p 
r\ 


arn7Qftfi4 1 


632 
1576 


670 


OR1J2 


0 


9 


105 . 94 


MITDSVLSSHLMVGVIL, 


c c. 
0 D 


M 


/it 1 U^. Zf *4 ■ J> 


52 
669 


671 


OR5BJn 
P 


3 






SIGSAAVNTKFPSCLGV 


H 0 


M 


nr u 1 j 70 j . -L 


2 ... 643 


672 


OR1J1 


0 


9 


105.82 


TIADSGICLHLIAA/UL 


Dj 


M 


an n? 4 i 

nf J. U ^ *J *1 *i • J. 


52 
669 


673 


OR13En 


0 






MLGSCLTNLQLLATLTA 




M 


ttT9Rl 1 1 
H. J Z Jll J J . J. 


1 S4 91 
16423 


674 


OR4KnP 


5 


14 


0.08 


IHGGMVIHTHFVNSLSM 




M 




j / ... 

930 


675 


ORlLnP 


5 


9 


105.84 


MYGNSFFHLHLQEAVLT 


54 


M 


AC023167.7 


60743 . . . 
61663 


676 


OR2CnP 


2 






FHGACAGTVGLMASFVL 


59 


M 


M84005. 1 


1 ... 936 


677 


OR4TnP 


9 


14 


0.21 


MLSELLSHSQFVKbLbl 


4 1 

H 1 




arm q?7? 4 


62255 . . . 
61317 


678 


OR5BnP 


1 






FVITSGCNIHNI VVNDF 


51 


M 


AF121977.1 


262 ... 
1197 


679 


OR4Kn 


0 


14 


0.21 


I HGGMTLHFQFINS I SS 


53 


M 


AB030896. 1 


1 ... 906 


680 


ORllLn 


0 


1 


254 . 4 3 


LVGACVTTLnMlLb VLi 


so 


M 


AF121972 1 


171 . . . 
1109 


681 


OR7E68 
P 


5 


10 


17 . 21 


i/APPIPT T rsT UTMDTlFf'T 

MAGGLLLULin lMrftt OLi 


S 6 
J o 


M 


AF102536 1 


22 ... 

669 


682 


OR7EnP 


2 


10 


17.21 




R4 


M 


AF073989 . 1 


547 ... 
1515 


UU J 


OR7E31 

P 


6 


9 


70.71 


TAGGELLDLHIMPAFGL 


55 


M 


AF073989. 1 


547 ... 
1515 


684 


OR7EnP 


3 


9 


70.71 


M VAC DVLDLHIMDSFGL 


58 


M 


AF073989.1 


547 ... 
1515 
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SEQ 
ID # 


Symbol 


D 


c 


Mb 

coord 


CDR 




c 
o 




Range 


685 


OR5AKn 
P 


3 


11 


52 . 82 


LAATLbMNVHr Lit VN.Lt 


1 Q 


D 

i\ 


050948 1 

\J *J \J ~ \J . J- 


34 ... 

978 


686 


OR5AKn 


0 


11 


52 . 83 


FAATCGMNVyb Lr VNijt 


1 Q 


D 
t\ 


vj j \j y *t v . 


34 ... 
978 


687 


OR5AKn 


0 


11 


52 .83 


FAATCGINVHF Dr VDLt 


1 Q 


D 
I\ 




34 
978 


688 


OR5BQn 
P 


9 


11 


52.82 


TTTTTLLLLLMLTFFFF 


42 


R 


U50948.1 


34 . . . 
978 


689 


ORINn 


0 


9 


105.94 


LLGGNVLPMHLIMGFLV 


56 


R 


AF091566.1 


1 ... 663 


690 


OR1J4 


0 


9 


105.94 


MITDNVLNSHLIVGVIL 


69 


M 


AF102524.1 


52 . . . 
669 


691 


ORINn 


0 


9 


105.94 


MLGDSLLVTHLVLGVLV 


85 


R 


AB038167.1 


1 ... 933 


692 


OR2AnP 


4 


3 


94.41 


TLAVCTIMVHHLGSIVG 


65 


M 


AF102521.1 


22 . . . 

DO" 


693 


OR2ANn 
P 


17 


9 


93.78 


.... . VWLEFMVNLLI 


23 


M 


AC074177.4 


128803 
129726 


694 


OR5K1 


0 


3 


104.47 


FCETCGAHIHLLFSVQF 


51 


R 


7\ nrt Qi ETC 1 

At uyio f o • i 


D Z ... 

663 


695 


OR2K2 


0 


9 


93.78 


MLGSCVTTLEFMVSLLI 


60 


M 


TV TO C T 1 C A 1 

AJ2d11c>4 . 1 


36615 


696 


OR8Hn 


o 


11 


51.76 


MAGTCGIDVNSI IVTLV 


51 


M 


AC069559.8 


36251 ... 
35322 


697 


ORnP 


15 


11 


51.76 


LIFKNLFSPPLXXHYIL 


28 


M 


X89682.1 


2 ... 472 


698 


OR4AnP 


14 


11 


50.28 


FGRRWGH I QLYGHNYV 


38 


M 


AB030895.1 


1 ... 924 


699 


OR4An 


0 


11 


50.28 


LHGGWGQFQI VNGSC I 


59 


M 


AB030895.1 


1 ... 924 


700 


OR6Sn 


0 


14 


0.58 


FFGAFAGPGPADLAVIS 


50 


R 


M64378.1 


1 ... 933 


701 


OR4RnP 


16 


11 


50.28 


NLGAIMEHVXSVNGNYL 


52 


M 


At XUZO^Z . 1 


a n 

H u ... 
660 


702 


OR13Cn 


0 


9 


86.77 


MLGTCG I NVQ FLTT FLT 


65 


M 


AJ133425.1 


61 . . . 
i m a 


703 


OR13Dn 
P 


4 


9 


86.77 


MYGSCVLNTELIGNFLS 


o4 


M 


ALU/ J / O J . D 


■J l X c. V *a 

372220 


704 


OR7EnP 


3 


11 


2.13 


MIACGVLDLHIINSFGL 


54 


R 


AF091580. 1 


7 ... 663 


705 


ORlOPn 
P 


1 


12 


59.88 


MIGICTTTTHLVATFI I 


49 


M 


AF247657. 1 


1 ... 945 


706 


OR8In 


0 


11 


51.76 


MWCCMISISVSLATLS 


50 


M 


AC069559.8 


137090 
138039 


707 


OR8G1 


0 






. .IIIGICVHCIVGNIV 


75 


R 


AF091576. 1 


52 . . . 
663 
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WO 01/27158 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


708 


ORnP 


/ 


1 o 
1 c. 


j y . o o 


rrpf:FAFFTT.T, 


34 


M 


AL359352 . 1 


145887 
14 5042 


709 


OR5F1 


0 


11 


51.76 


MIATCGANVNHSLANIG 


50 


M 


Y15525.1 


1 ... 705 


710 


OR5FnP 


1 


11 


51 .76 


MIATCGANVNYFFANKG 


52 


M 


Y15525. 1 


1 ... 705 


711 


OR6BnP 


6 


2 


251.7 


LSVCCFSIIKFDLAILF 


70 


M 


L14567.1 


17 . . . 

667 


712 


OR2D1 


0 






t t rrrn c:\r\7nirTTfZT T T 

LLoLtnJ V VUt 1 I'OxJjX 


64 


M 


AF073987 . 1 


2 ... 649 


713 


OR5ASn 


0 


11 




PariAL/djO 1 v nxjjjxj\^x 


52 


M 


AC068904 . 1 
5 


165039 
165965 


714 


OR5SnP 


8 


2 


251.7 


FSSTTGRSVQLKLCMMN 


64 


R 


AF091579.1 


7 , . . 663 


715 


OR5AQn 
P 


0 


11 


51.76 


SAVTDAGN1 HCsrt ol/ir 


O X 


p 


AO U U / 1 . 1 


203 ... 
1129 


716 


OR6BnP 


3 


2 


251 . 7 


LSVCCFbl IKr U-L/il Lit 


67 
\J 1 


M 

1 j 


T.14S67 1 


17 ... 
667 


717 


ORSJnP 


2 


11 


51 .76 


YVLTGbGN I nblit b 


3 £ 


p 


XR0671 1 


203 ... 
1129 


718 


OR 9 An P 


4 


7 


146 . 91 


QLGTLVFFWr/UjMAx 1L> 


A A 


M 


MM 010991 

1 


1 ... 939 


719 


ORSBEn 
P 


2 


11 


51.76 


if r>T fnpuT XTT LI C TT CTCT 

YSLTCVLNInor Lol o 1 


A ^ 


R 




7 ... 663 


720 


OR9An 


0 


7 


14 6.91 


LLGTFVFFWr VJjMHV IiU 


A 7 


M 


1 


1 ... 939 


721 


OR8Hn 


0 


11 


51.76 


MVGTCGIDVNbx 1A1 LiV 


D x 


M 




36251 . . . 
35322 


722 


OR5BNn 
P 


14 


11 


51.76 




54 


M 


AF102528.1 


52 . . . 
669 


723 


OR8Jn 


0 


11 


51.76 


LLIVVLYTWCVSANLF 


80 


M 


X89682.1 


2 ... 472 


724 


OR9NnP 


9 


7 


146.91 


LFGTFIIIIIL. AAAAA 


36 


M 


NM 010991. 
1 


1 ... 939 


725 


OR7EnP 


4 


7 




MVACGMLDLHITHSFAL 


51 


R 


AF091580. 1 


7 ... 663 


726 


OR7E9P 


3 


7 






.J X 


M 


AF073989. 1 


547 . . . 
1515 


727 


OR8KnP 


8 


11 


51 . /b 


mmttt TfT'.TTnTT TMT.P 
MMl I LlL^x xUxxjllNxjr 


36 


M 


AC069563. 9 


28460 ... 
29383 


728 


OR2AnP 


1 


/ 


1 A Q Q "7 


t t aur 


44 


M 


AF102521 . 1 


22 . . . 
669 


729 


OR8Kn 


0 


11 


51.76 


LLII FIYQMFKSFSNLS 


56 


M 


AF102528. 1 


52 . . . 

669 


730 


OR7E39 
P 


4 






MVGGELFHLHIMPAFGL 


55 


R 


AF091580. 1 


7 ... 663 
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SEQ 
ID # 


Symbol 


D 


C 1 


coord 


ZDR 




Q 
O 




Range 


731 ( 


DR7E27 
P 


3 






MAGGELLDLHIMPAFGL 


O ' 


M 


rit iUt j j v • x 


22 ... 
669 


732 


DR2Hn 


0 


6 




FLGTCVMEVQSLASILV 


0 1 


M 


at HI P. 6^0 1 
rlLiU / O D j U . X 


41097 
40165 


733 


OR13Cn 
P 


2 


9 


40.16 


MLGACG AT VQLMAN FLV 


o / 




rtU x ~> -J ** O . x 


61 

1017 


734 


OR13Cn 


0 


9 


40. 16 


MFGACGAAVQLMTN FLV 


o y 


It 4 

M 


A J 1 J J fi ^- ^ • x 


ox ... 
1017 


735 


OR2S1P 


4 


9 


40.16 


MFGACGANVQLMTNFLL 


8 9 


M 


AJ^l^l 134.1 


C, I \J -J ... 

1747 


736 


OR2AMn 
p 


1 


9 


40. 16 


RRRRRV . MMMMM 


63 


M 


AJ251154.1 


2703 ... 
1747 


737 


OR1N1 


0 


1 




MLGDSLLVTHLVLGVLV 


85 


R 


AB038167.1 


1 ... 933 


738 


OR2S2 


0 


9 


40. 13 


MFAGCSIAVHLMTNFLV 


83 


M 


AJ2bl lb4 . 1 


Z / U J ... 

1747 


739 


OR7E26 
p 

rr 


4 


1 




MAGGELLDLHIMPAFGL 


56 


M 


AF102536.1 


22 . . . 
669 


740 


OR1F11 


0 






LAGNNGVNLHLI EGVMT 


99 


R 


M64377. 1 


1 ... 939 


741 


OR5ACn 
P 


3 


3 


103. 97 


FGATCIIHIHLIFSIQF 


66 


R 


AF091575 . 1 


DZ . . . 

663 


742 


OR5B10 
p 

rr 


2 


13 




MVATNGCNLRDLMSNVL 


4 6 


M 


AF o . 1 


^9 

669 




OR 9 Z\rk P 


1 


12 


85.7 


TLAVCAFLVHLIACILG 


76 


M 


AF102521.1 


22 . . . 
669 


744 


OR1E5 


0 


13 




MLGDSLLHLHLIMGILI 


83 


R 


Y07557.1 


1 ... 942 


1 A ^ 


OR 4 Pn 


o 


6 


185.71 


IHGGMVLHFQFVNSICG 


51 


M 


AB030896.1 


1 ... 906 


746 


OR5CnP 


0 


9 


40.53 




47 


M 


Y15525.1 


1 ... 705 


747 


OR2WnP 


0 


6 


31.62 


LLGGCVSNIMQALAIIA 


64 


M 


AF102o 1 b . 1 


R9 

669 


748 


OR2L2 


0 






. . II IGINAHYVSSFLL 


48 


M 


AF102537.1. 


16 . . . 


749 


OR4H8P 


2 


14 




MHGCILGHVQLVNSISG 


56 


M 


AF259072.1 


104176 
105099 


750 


OR5D10 
P 


5 






LCWTTWCTLFTSANES 


4 4 


R 


At Vive. I? J . 1 


91 1 
1143 


751 


OR7A12 
P 


1 


14 




MV I VSAMN I EMMS ALGG 


C Q 
DO 


M 


nroo Qt:co 1 
At j jjO . x 


1 ... 927 


752 




r 
I 






II IGINAHYVSTFLF 


48 


M 


AF102527.1 


22 . . . 
669 


753 


OR2F3P 


c 


) 14 




LLGGFTSSVQIISSLLT 


55 


M 


AF073974 . 1 


41 . . . 
649 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


754 


OR4H10 
P 


2 


15 




MHGC I LGHVQLVNS 1 SG 


57 


M 


AF259072. 1 


104176 
105099 


755 


OR5H1 


0 






t t tt cut hfvfstof 


56 


R 


AF091575. 1 


52 . . . 
663 


756 


OR2K1 


0 






TT TTTTT VrMV^TJ.T 

. , J.1 ill iilV V-»l J VOJjiJl 


58 


M 


AJ133428 . 1 


61 . . . 
1017 


757 


OR7E11 
P 


7 


11 




MAGGEFLDLHILPAFGL 


52 


M 


AF073989.1 


547 ... 

1515 


758 


OR7A3P 


1 


11 




MVIVSAMNIEMMSALGG 


68 


M 


AF283558. 1 


1 ... 927 


759 


OR6A1 


0 


11 




LLGCCGGIVKLDLAILG 


91 


R 


M64386.1 


130 ... 
975 


760 


OR5I1 


0 


11 




FCADSLGSVH FLYGVE I 


52 


M 


Y15525. 1 


1 ... 705 


761 


OR2H3 


0 


6 




-r t rmr>UTrunCW7\CTT \J 

ILGTCVlCjVUbVAolij V 


Q C 

0 D 


M 


r\Li U r O D J v • 1 


41097 . . . 
40165 


762 


OR10J1 


0 






MVGICGIVTQSTI0VL.V 


1 1 
f J 


M 


A 27 £. J D J . X 


8035 . . . 
8961 


763 


OR7E3P 


3 


11 




MFACGVLDLH 1 1 DS FGL 


54 


M 


AF102536. 1 


22 . . . 
669 


764 


OR1D6P 


1 


11 




LVVANLFYIHLLTGIFI 


48 


R 


Y07557. 1 


1 ... 942 


765 


OR5D10 
P 


2 


18 




LCVVTTWCx lit 1 bAobb 


A C\ 


D 
i\ 


. X 


34 ... 

978 


766 


OR5D5P 


2 


18 




LCWTTWCTLr 1 bANho 


A £ 






29192 . . . 
30115 


767 


OR52A1 


0 


11 




MHQGSMAVCL I G VAVAF 


72 


M 


NM_013620. 
1 


1 ... 945 


768 


OR2AEn 


0 


7 


98.36 


HLGGCMGNIHIVSSLLL 


48 


M 


AC073769. 1 


143294 
142353 


769 


OR6LnP 


7 


10 


149. 44 


LLSSCSSAVbLiKAAlliA 


a n 

H U 


M 

rj 


MM f)1 0 Q R ^ 
1 


178 . . . 
97 5 


770 


OR6LnP 


7 


10 


14 9.44 


L L S O C O b A V b Li KHA 1 Ll/A 


H X 


M 


mm D 10983 

1 


178 . . . 
975 


771 


OR7MnP 


7 


10 


14 9.44 


M\fV\/Q T 




M 


AC073947 3 


43325 . . . 
42733 


772 


ORl3Cn 


0 


9 


8 6.7/ 


Mr LjAL^Ij 1 UVL^rrioiNVijX 




M 


AJ133428 1 


61 ... 
1017 


773 


OR13Cn 


0 


9 


86.85 


MLGTCGAN VQ FMAT FTM 


71 


M 


AJ133425.1 


61 . . . 
1014 


774 


OR2InP 


6 








79 


M 


AL078630. 1 


151152 
150391 
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WO 01/27158 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


775 


OR 4 An 


0 


11 


50.28 


LHGGVVGHFQWNSICV 


58 


M 


AB030895.1 


1 ... 924 


77 6 


OR2InP 


3 






RRRRRMARILL 


77 


M 


AL078630.1 


151152 

± ~> u _? y x 


777 


OR4AnP 


4 


11 


50.28 


LHGG VVGS FQWNG I CV 


53 


M 


AB030896. 1 


1 ... 906 


778 


OR4AnP 


7 


11 


50.28 


PHGG AVAH FQWNG I C V 


57 


M 


AB030896. 1 


1 ... 906 


779 


OR8C1P 


2 


11 




LCVHCGMGVHCMIWVV 


72 


M 


AC068905.1 
2 


76922 ... 
75948 


780 


OR4AnP 


1 


11 


50.28 


LHGD WGH FQWNG I CV 


56 


M 


AB030896.1 


1 ... 906 


781 


OR7E15 
P 


5 


11 




MAGGELQDVHIMPAFGL 


54 


M 


AF073989.1 


547 ... 
1515 


782 


OR10A1 


0 


11 




MFGVCAPWQWAGTVVI 


76 


M 


AF247657.1 


1 ... 945 


7 R ^ 

f O J 


OR? An 


o 






TSAVCTCLVHLI 


70 


M 


AF102521.1 


22 . . . 
669 


784 


OR7EnP 


6 






MAGGELFHLHIMPAFGL 


57 


M 


AF073989.1 


547 ... 
1515 


785 


OR7En 


0 






MAGGDFLDLHIVPAFVL 


54 


R 


AF091580.1 


7 ... 663 


/ O 0 


VJtw J./\ J. 
P 


-j 


1 1 

X -L 




MHTLSARLPLLAVITFL 


43 


R 


AF079864 .1 


632 ... 
1576 


7R7 
(Of 


DR7F4 7 
P 


4 






KAGTNLLDLYIMPTFGL 


56 


M 


AF073989. 1 


547 . . . 
1515 


788 


OR5B5P 


2 


3 




MAATNICNIHELVANIS 


48 


M 


AF146372.1 


509 ... 
1456 


789 


OR1F10 


0 


3 




MFVDNGVNLHLIEGVMT 


72 


R 


M64377.1 


1 ... 939 


790 


OR8G2 


0 






. . IIIGLGIHFVLSNIT 


75 


M 


AF102518.1 


52 . . . 
669 


7 Q 1 


UiMOll 


o 


11 


54 .08 


MIWNILITHLLVGVIF 


55 


M 


AC073769.1 


133488 

IJi JJU 


792 


OR4AnP 


3 


11 


50.73 


LHGGAVGHFQWSGLCV 


56 


M 


ABO JOo y b . 1 


i ... yuo 


793 


OR4AnP 


7 


11 


50.76 


LHGGI LGH FQWNGMCV 


58 


M 


ABO JUo yb . 1 


i on £ 


794 


OR4AnP 


5 


11 


50.66 


LHGGVLGH FQWNGMRV 


5b 


M 


AbUJUo y b . i 


1 ... jUD 


795 


OR4AnP 


7 


11 


50.73 


PHGGWGRFQWKVICV 


54 


M 


AB030896. 1 


1 ... 906 


796 


0R4AnP 


1 


11 


50.81 


LHGGI VGHFQWSGMCV 


60 


M 


AB030896. 1 


1 ... 906 


797 


OR4AnP 


10 


11 


50.81 


LHGGWGN FQWNG I CV 


55 


M 


AF102522. 1 


40 . . . 

D DU 


798 


0R4An 


0 


11 


50.73 


LHAGVAGH VQFMNG ICV 


62 


M 


AB030895.1 


1 ... 924 


799 


OR 4 An 


0 


11 


50.73 


LHGGWGHVQFVNGICV 


57 


M 


AB030896. 1 


1 ... 906 


800 


OR7E42 
P 


4 






MAGGELQDVHIMPAFGL 


54 


M 


AF073989. 1 


547 ... 
1515 
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SEQ 
ID # 


Symbol 


D 


c ^ 

c 


lb c 
;oord 


:dr 


2. < 
5 


2 J 




Range 


801 C 


)R2M3P 


2 






[TLGCFLDI DALCCMI F 


55 1 


* i 


&F102537.1 


16 . . . 
669 


802 C 
1 1 


)R4H11 


2 


4 


I 


vlHGCILGHVQLVNSISG 


57 


M 


AF259072.1 


104176 
105099 


803 ( 


}R7E57 

P 


5 






MAXGEFLDLHILPAFGL 


D 1 


M 




22 ... 
669 


804 < 


DR2B1P 


0 


5 




LLGAY ATNWLLLVS FH I 


78 


R 


t ^ a m a 1 


73 ... 
1011 


805 


OR7E34 
P 


2 






MAGGDSLDLHIMPAFGL 


56 


M 


At u / j y o y • J- 


547 ... 
1515 


806 


OR7E56 
P 


4 






MAGDELFFLHILPAFGL 


52 


M 


AF073989.1 


547 ... 
1515 


807 


OR3AnP 


1 


5 




L HAG C AC N T H ALAAMAA 


49 


M 


AFU / t> / . 1 


9 64 9 


808 


OR4H5P 


2 


5 




MHGCI LGH VQLVNS I SG 


56 


M 


AFZo yu /z . i 


105099 


809 


ORlEn 


0 


5 




MLGDSLLHLHLIMGILI 


82 


R 


Y07557.1 


1 ... 942 


810 


OR51Cn 
P 


2 


11 


3 




48 


M 


AF12 iy / b . 1 


JU ... 

1012 


811 


OR2WnP 


2 


6 


30.51 


LLGGCVSNIMQALAIIA 


64 


M 


AFlUZOlo . 1 


^9 
669 


812 


OR51B1 
P 


5 


11 




AHSVSGRSPVRPLITIL 


68 


M 


a irn T 1 fi P H 9 
At U / 1 U oU . £ 


-L -J -/ -L ... 

16851 


813 


OR7E81 
P 


3 






MAGGEFFSLHIMPAFGL 


54 


M 


AFlUz o Jo . J- 


99 1 

669 


814 


OR7E44 
P 


1 






MAGGELFDLHIMLAFGL 


53 


M 


AFU / J y o y . 1 


7 

1515 


815 


OR5B7P 


2 


6 




MAATN I CN I H EL VAN I S 


47 


M 


1 


1 ... 948 


816 


OR7E36 
P 


4 






MAGGELFFLHIMPAFGL 


58 


M 


At u / o y o y . 1 


547 
1515 


817 


OR2A5 


\ C 


) 1 




TMAHCTCLVHLIASILG 


74 


M 


nrl fi9 R 9 1 1 
At lUzO/iJ. . J- 


22 ... 
669 


818 


OR5B1P 




> e 


i 


MAATNICNIHELVANIS 


47 


M 


AF146372.1 


509 ... 
1456 


819 


OR8B8 


( 


) ii 


L 137. 6i 


I LLWSGMGAHCWVDIV 


72 


M 


AC069559.8 


120212 
119283 


820 


OR8B4P 


( 


d i: 


L 137.7: 


L LCVNCGVGAHSFWITL 


8-? 


M 


AC068910.2 
1 


133103 
132162 \ 
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SEQ 
ID # 


Symbol 


D 


C I 

< 


zoord 




4 

o 


s 


Acc 


Range 


821 < 


DRnP 


15 


11 


137.77 


LC VEN RRT AT HCKS H 1 1 


35 


M 


AC069563.9 


60295 . . . 
59327 


822 ( 


DR8B3 


0 


11 


137.77 


LLV I CAMGAH C VWN I V 


85 


M 


AC069563.9 


129775 
130725 


823 


OR2Bn 


0 


6 


30 . 51 


t t rcfACMT OW7 T Q FT T 


89 


R 


L34074 . 1 


73 . . . 
1011 


824 


OR8B6P 


6 


11 


137 .77 


LAFFCGLSAHL VAAAV i 


7 ? 


M 


AC069559. 8 


96224 ... 
95292 


825 


OR8B5P 


6 


11 


137.77 


LFFFXGLGAHCWANTV 


73 


M 


AC069559.8 


96224 ... 
95292 


826 


OR4E2 


0 


14 


1.7 


LHACIAGHGQLINSISS 


90 


M 


AF259072.1 


104176 
105099 


827 


OR8B7P 


4 


11 


137 .77 


FCVI CGWGAHCVAAI FV 


/ 1 


M 




96224 . . . 
95292 


828 


ORllJn 
P 


3 


15 


1 . 82 


FSCAGFGSMPLCVS 111 




M 

re 


AF121972 1 


171 . . . 

1109 


829 


OR4E1P 


3 


14 


1 . 7 


MHACIAGHALLINSISV 






t\JD\J -J v O J ~J • J- 


37 ... 
930 


830 


ORlODn 
P 


7 


11 


137 . 96 


HHHILLGNVLSI 


o D 


M 


nr A74 177 4 


12106 . . . 
13038 


831 


ORnP 


10 


14 


1.7 






M 

re 


/Tit XU^JJf • J. 


22 ... 
669 


832 


OR8D2 


0 


11 


137.77 


LLVIGVLWVHRLIGNTA 


70 


M 


AC073947.3 


29192 ... 
30115 , 


833 


ORllIn 
P 


1 


1 


126.31 


FGAACGCL I T LAT SVTI 


51 


M 


AL359381.1 


175785 
176720 


834 


OR11 Jn 
P 


1 


15 


1 . 82 


FSCACFGWTPLCI S I I L 




M 


AF121972 1 


171 ... 
1109 


835 


ORlOAn 
P 


3 


11 


5 . 64 


MFGVCTPVVQWAb I V V 1 


7 4 


M 


AF24 7657 . 1 


1 . . . 945 


836 


OR8C3P 


5 


11 


137.77 


LCVHCGMGVHCMIVWV 


73 


M 


AC068905.1 
2 


76922 ... 
75948 


837 


OR2DnP 


€ 


11 


5. 64 


LLGCCGSVVDFITGILI 


62 


M 


AF073987.1 


2 ... 649 


838 


OR4PnP 


C 


11 


51.03 


LHGGIVGHSQL 


59 


M 


AB030895.1 


1 ... 924 


839 


OR7E21 
P 








MAGGEr 1 UiihlMrm bit 


50 


M 


AF073989. 1 


547 ... 
1515 


r 4 n 

O H \J 


OR2M1 


c 






IVLGCFLDIYAICSMLF 


55 


M 


AF102537.1 


16 . . . 

669 


841 


OR7AnP 


4 


IS 




NLAGVVMNLQM 


63 


M 


AF073970.1 


41 . . . 

649 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


842 


0R5D11 
P 


1 


8 




LCVVTTWCTLFTSANES 


44 


R 


AFU 1 UZ . 1 


Oil 

z n ... 
1143 


843 


OR7E50 
P 


7 


8 




I WCDMLDLHVFLDI FL 


57 


M 


AF1 Uzbob . 1 


ZZ . . . 
669 


844 


OR7E45 
P 


3 






MAGGELFDLHIMPAFGL 


54 


M 


AF 0 / 39by . 1 


1515 


845 


OKI Ell 
P 


6 






MAGGEFLDLHIMPAFGL 


51 


M 


AF073989.1 


547 ... 

1 J 1 0 


846 


OR8B2 


0 


11 


137.77 


LL V I CAMGAHCVWN I V 


84 


M 


AC069563.9 


129775 
130725 


847 


OR8D1 


0 


11 


137.77 


LVWGALSTHALIANTV 


87 


M 


AC073947 . 3 


29192 . . . 
30115 


848 


OR8B1P 


4 


11 


137.77 


LLLVCGMGAHCVWNIV 


84 


M 


AC069559.8 


96224 ... 
95292 


849 


OR7A1P 


2 


19 




MI WSVVYLQMMTSLGG 


72 


R 


M64376.1 


1 ... 999 


850 


OR7E8P 


4 


8 


13.72 


MVACGVLDLH 1 1 DS FGL 


53 


M 


AF102536. 1 


22 . . . 
669 


851 


OR4DnP 


7 


11 


55.86 


MHGGVAGHVQLMNNISL 


58 


M 


AC019272.4 


183633 
182701 


852 


OR7E80 
P 


7 


8 


13.72 


MAGGELQDVH IMPAFGL 


54 


M 


AF073989.1 


547 ... 

1515 


853 


OR4DnP 


5 


11 


55.86 


MHGGAAGHVQLMNNLTL 


62 


M 


AC019272.4 


183633 
182701 


854 


OR7E10 
P 


8 


8 


13.72 


I VAC DLL DLH 1 1 DS FGL 


55 


M 


AF073989. 1 


547 . . . 
1515 


855 


OR10B1 
P 


3 


19 


17.91 


MLGCCLSVIEMILSVVM 


85 


M 


AC012302.5 


54283 . . . 
55224 


856 


OR2InP 


3 






LLLLMARILL 


75 


M 


AL078630.1 


151152 
150391 


857 


OR4Dn 


0 


11 


■ 55.86 


MHGGVGGHAQLMNNVSF 


65 


M 


AC019272.4 


183633 
182701 


858 


OR5ACn 


0 






.VWVIIHVHLIFGIQP 


65 


R 


AF091575.1 


52 . . . 

bbo 


859 


OR2I1 


0 


6 


33.63 


LLGSCASNAQLMARILL 


79 


M 


AL078630. 1 


151152 
150391 


860 


OR10H1 


0 


19 


19.86 


MFGFSCGMVVAGLVTAL 


88 


M 


AC023604.2 


245345 
246298 
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SEQ 
ID # 


Symbol 


D 


C I 

< 


4b 

-oord 




% 


s 


Acc 


Range 


861 ( 


3R7E59 
p 


5 






pnr7\D\;rT t UTMDaTTflT 


S3 


M 


AF102536. 1 


22 . . . 
669 


862 ( 


DR7E28 
P 


4 






w t\ /-'<'-' ITT T HT UTMDaFfZT. 


54 


M 


AF073989. 1 


547 ... 
1515 


863 


DR5B3 


0 






uunTMPHMT U HT UVKT T 
MVAI NbLINlnULi v vv*i l 


51 




U50948 . 1 


34 ... 

978 


864 


0R2A6 


0 






TLAHCAr LVrijlAUiijo 


75 


M 


AF102521 . 1 


22 . . . 
669 


865 


0R6Cn 


0 






. WWCAI PPLVMAALl 


A 1 


M 


mm 010 991. 

1 


1 . . . 939 


866 


OR7E54 
P 


5 






MAGGEFLDLHIMPAr (jL. 




M 


AF073989 1 


547 . . . 
1515 


867 


OR7E48 
P 


3 






fc ^ _ — « t r*\ t tt Tiki T*\ T\ TT^/™* T 

MAGGEFLDLHIMPAFGL 


D f 


D 

f\ 




7 ... 663 


868 


OR67An 
P 


3 


11 


76. 42 


MHSCAGTLPAQGIAVSL 


83 


R 


AF091561.1 


52 . . . 
663 


869 


OR4 DnP 


1 


11 


55.86 


MHGGVAGHVQLMNNLTL 




M 


arm <3?72 4 


183633 
182701 


870 


OR4CnP 


1 


11 


50. 91 


VHGC I LG HAQLLNS ICS 




M 




1 ... 906 


871 


OR4DnP 


2 


11 


55.86 


IHGGIAGHVQLMNNVTL 


65 


M 


AC019272.4 


183633 
182701 


872 


OR10H2 


0 


19 


19.94 


MFGFSCGMVVAGLVMAL 


85 


M 


AC023604.2 


245345 
246298 


873 


OR10H3 


0 


19 


19.94 


MFGFSWGMMVMGLVTAI 


75 


M 


AC023604 .2 


214343 
213396 


874 


OR55Cn 
P 


2 


11 


2 . 65 




H D 


M 




160 . . . 
1053 


875 


OR55Bn 
P 


3 


11 


2.65 


. WWLQVPLLGMCTVb 


JJ 


M 




160 . . . 
1053 


876 


OR52Vn 
P 


4 


11 


4 . 19 


LHNHIMVYXFLGTTbfLi 


A ft 


M 

L J 


NM 013619. 

1 


118 . . . 

969 


877 


OR2B3 


C 


) 6 


33 . 64 


t t r> t\ r^ 1 t*t \"\T r>T T rCTT T 

LLGACFINLyLiit blJjl 




D 

r\ 


L34074 . 1 


73 . . . 
1011 


878 


OR52Tn 
P 


i 


i 11 


4 . 22 


r->/*»tJTrT TPT nCT HTT TTQ 


45 


M 


AF121975. 1 


50 . . . 
1012 


R7 9 


OR2 J1P 


c 


) 6 


» 33.64 


LLGTCASTLHFLMSFVI 


57 


R 


L34074.1 


73 . . . 
1011 


880 


OR52Hn 
P 




I 11 


4. IS 


) LHFVSGRVPCLGVPTVT 


60 


M 


AF121975.1 


50 . . . 
1012 
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ID # 


O yTuiJUl 


n 
\j 


c 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


R R 1 
0 0 1 




n 

V 




33 , 64 


LLGTCASNLHFLTSFVI 


58 


R 


L34074 .1 


73 . . . 
1011 


Q Q 1 

ooZ 




n 
u 






FHSVS . ... WRLFS 


75 


R 


AF079864 .1 


632 ... 
1576 


R R "5 


OR A On 


n 

u 






. VVVVAGHMQLVNSLSV 


56 


M 


AB030893.1 


37 . . . 

930 


Q Q A 
0 0 H 


p 


9 
c. 


1 1 


4 ? ? 


I.HFVSVRTSILGVPSVL 


60 


M 


AF121975.1 


50 . . . 
1012 


q O r 

00 D 




Q 

y 


D 


^ 64 

J J . OS 


T UC.CC PI YSEALVCMLV 


81 


M 


AJ132195.1 


79 - . . 
906 


886 


ORblEn 
P 


1 






FHctaqVR FPT.T.nAT AMV 


90 


R 


AF079864 . 1 


632 . . . 
1576 


887 


OR2J2 


0 


6 


33.64 


LLGICAIILHFLMSFVI 


57 


R 


L34074 .1 


73 . . . 
1011 


888 


0R2In 


0 






RRRRRRMARILR 


77 


M 


AL078630.1 


151152 
150391 


889 


0R2 J4 P 


c 
0 


o 


33 . Ofl 


Jj JjO 1 k^nO liL/flL J-i 1 *J L V J_i 


56 


R 


L34074 . 1 


73 . . . 
1011 


890 


OR7E4 0 
P 


4 






MnrrnTT ni ytt PDFflT, 


55 


M 


AF073989. 1 


547 . . . 
1515 


891 


/*\ r-> O LI X 

OR/:H4 r 




b 


"3 ^ £4 
JJ - 0 4 


J_j J_»0/-i J. Jj 1 ylyfti JLTV ij JJ JJl J 


63 


M 


AL078630 . 1 


41097 . . . 
40165 


892 


OR7E52 
P 


5 






I WCDVLDLHVCDI FGL 


61 


M 


AF073989.1 


547 ... 
1515 


893 


0R2InP 


9 








80 


M 


AL078630.1 


151152 
150391 


8 94 


OKoLl 


U 






T TfZVFTVT PAT.GCATLF 


52 


M 


NM 010991. 
1 


1 ... 939 


o 95 


/^n inn 
OR / b jU 

P 


*3 
O 








56 


M 


AF073989.1 


547 . . . 
1515 


one 

8 96 


p 


n 
U 




jj . oy 


T WT^VFNTONTjFSVTL 

Jj V V 1 u V t lilyliUC w » i jj 


51 


R 


AF091579 . 1 


7 ... 663 


o 9 / 


UK / HI r 




1 Q 

i y 


11 . JO 


MMf^TVT.YTOLLVALDV 

I Jl j O O J. V Jj X J. <^u jj v njju v 


74 


M 


AF073989 . 1 


547 . . . 
1515 


o yo 






1 1 

1 X 


^4 4 S 


MVATNGCNFHGLTSNI F 


47 


R 


U50948.1 


34 ... 

978 


899 


0R5AZn 
P 


1 


11 


53.69 


MIGTCTVNLLCILCLIF 


48 


R 


AF091579.1 


7 ... 663 


900 


0R5Bn 


0 


11 


54 .45 


MVATNGCNIHDLWNI I 


51 


R 


U50948.1 


34 . . . 
978 



135 
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SEQ . 
ID # 


Symbol 


D 


c In 

Ic 


lb C 
roord 


:dr 


b * 


3 i 


\cc 


Range 


901 C 


)R52Bn 


0 


11 


4.221 


t rcj\OT DCT PABCTT 


64 I 




NM 013619. 
1 


118 ... 
969 


902 C 


)R5BnP 


2 


11 


54 . 45 1 


inumMTrviTUrT 1 7 7i K! T C 
^lAATNICN IHLbVAlNl O 


4 Q 
*i 


R 


U50948 . 1 


34 . . . 
978 


903 C 


)R52Dn 


0 


11 


4 .19 


MHYASVRI PFLGVAAML 


66 


M 


AF121976.2 


474 ... 

1307 


904 ( 


DR7A11 


1 


19 


17 .72 


MVEASAI DLHMMAVLGV 


67 


M 


AF283558.1 


1 ... 927 


905 ( 


DR5BnP 


9 


11 


54 .45 


MAATSALTVDDLLQFt L 


A 1 


M 

Li 


mm 013728. 

1 


1 ... 948 


906 


0R51An 
p 


5 


11 


4.19 


THSWFSRMPLLGIVAFV 


OV 


p 


AF0798 64 . 1 


632 . . . 
1576 


907 


t 

0R7A15 

n 


4 


19 


17.72 


MIVGSVTHLHMMAALGG 


74 


R 


M64376. 1 


1 ... 999 


908 


r 

OR7C2 


0 


19 


17.72 


I IGCNGIGLETMVTLGF 


98 


R 


AF091580.1 


7 ... 663 


909 


OR7E23 
P 


7 


21 


20.89 


MAGGELFHLQIMPAFGL 


57 


M 


AF073989.1 


547 ... 
1515 


910 


OR2E1 


8 


6 


32.05 


AHACCTINLQI . RRRRR 




M 




106872 
105934 


911 


OR1I1 


0 


19 


17.87 


MHGTSAIQIHLIFGVGS 


57 


R 


AF091566.1 


1 ... 663 


912 


ORlRnP 


3 


17 


3. 12 


MVGI S AVHLHLI EGWA 


45 


R 


M64377.1 


1 ... 939 


913 


OR4F3 


0 


8 


0.07 


IHGGMVLHFQFVNSICG 


D 1 






1 ... 906 


914 


OR2AEn 


0 


7 


98.7 


HLGGCMGNIHIVSSLLL 


49 


M 


AC073769.1 


143294 
142353 


915 


OR2InP 


7 






TTTTTMARILL 


72 


M 


AL078630.1 


151152 
150391 


916 


OR52An 
p 


2 






IHSASVRFPLLGXPPPP 


94 


R 


AF079864 . 1 


632 ... 
1576 




OR7P1 


c 


19 




ITGCNGIGLETIATLGI 


81 


R 


AF091580.1 


7 ... 663 


918 


OR2A3P 




7 


149.13 


. MLAACTCLINLVGGVLG 


63 


M 


AF102521.1 


22 . . . 
669 


919 


OR7A5 


c 


) IS 


)| 


MIAGNAMYLQMITVLGb 


1 A 


M 


AF283558 . 1 


1 ... 927 


920 


OR2InP 








KJI 7\ T3 T T T 


67 


M 


AL078630. 1 


151152 
150391 


921 


OR7A10 


( 


D 15 


)\ 


MLVGNAMNLQMMAVLGG 


76 


R 


M64376.1 


1 ... 999 




OR9 An 


( 


] 






81 


M 


AF102521.1 


22 . . . 
669 


923 


OR2M2 




D 




IISGCFLDIDAICCMLF 


57 


M 


AF102537.1 


16 . . . 

669 
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SEQ 
ID # 


Symbol 


D 


C I 
( 


4b C 
^oord 


:dr 


% 


S 


Acc 


Range 


924 ( 


DR7A8P 


2 


19 


1 


4LAVSSLNLQMIATLGG 


71 


M 


AF283558.1 


1 ... 927 


925 < 


DR2An 


0 






PCM7PTTT TUT 


78 


M 


L14566. 1 


62 . . . 
667 


926 < 


DR7E20 
P 


4 






MAGbLLLc ijHirirAr kjLj 


56 


M 


AF073989. 1 


547 ... 
1515 


927 


DR2AnP 


3 






rpT T\ lir'TrT T 


65 


M 


AF102521.1 


22 . . . 
669 


928 


OR5BHn 
P 


7 








34 


M 


Y15525.1 


1 . . . 705 


929 


ORlEn 


0 






LMGDSLLHLHLIMGISI 


92 


M 


AC068902.1 
i 

J- 


196434 
195499 


930 


ORlEnP 


1 






. . _ _w v » tit TIT T TPUUT 

MLGDSLLHLHLI IGVVL 


QO 

yo 


M 




32 ... 
649 


931 


OR5Bn 


0 


11 


54 .45 


FVITSGCNIHNI WNUr 


D 1 






34 ... 

978 


932 


OR8RnP 


12 


11 


73.74 




52 


M 


AC069561.1 
0 


7848 . . . 
8783 


933 


OR5ANn 


0 


11 


55.69 


YSGLSGTAFQATLTFGA 


55 


R 


AF091564.1 


7 ... 663 


934 


OR5ANn 
P 


1 


11 


55. 69 


YSGLCGTGIQATLTFGT 




M 




1 ... 705 


935 


OR5BRn 
p 


8 


11 


55. 69 


MSNVCGTVIQATLTFGT 




M 


Y*l ^9 S 1 


1 . . . 705 


936 


OR2A1 


0 


7 


149.18 


TLGHCTCLAHLIACFLG 


77 


M 


AF102521.1 


22 . . . 
669 


937 


ORlOAn 


0 


11 


6.81 


MLGGC FLLVQWAGT 1 1 V 


54 


M 


AF247657.1 


1 ... 945 


938 


OR2A9 


3 


7 


149. 18 


TLAHCTCLVHLIACILG 


7ft 


M 


Arin?52i l 

/*vr x <l -J j. . j- 


22 ... 
669 


939 


OR2A7 


0 


7 


149. 18 


TSAVCTTLIHLVGAGLG 


81 


M 


L14566. 1 


62 . . . 
667 


940 


OR10A3 


0 


11 


6.81 


MLGGC FSWQWAGT I V V 


D D 


M 


AF247657 1 


1 . . . 945 


941 


ORlOCn 


c 


6 


33. 36 


MLGACSCVGHF1A1 LIU 


D y 


M 


AL365336 1 


122764 
121784 


942 


OR7A2P 


c 


) IE 




M V I V S VMN LQVMAAL DG 


73 


M 


AF283558.1 


1 ... 927 


943 


ORlOWn 
P 




! 11 


54.3 


MIGSCASLQLFVAAAIV 


47 


M 


AC012302.5 


54283 ... 
55224 


944 


OR7A17 


c 


) li 


) 


MVGGSAINSQMMAALAG 


76 


M 


AF283558. 1 


1 ... 927 


_7 *i 


OR5Bn 


( 


) 1] 


54 .2 


MAATNGINIQDLISNVF 


47 


M 


AF102528.1 


52 . . . 
669 


946 


OR5BnP 


t 


3 11 


l 54.2 


MVATNGCNLRDLMSNVL 


47 


M 


AF102528.1 


52 . . . 

669 



WO 01/27158 



PCT/US00/27582 



SEQ 

ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


947 


OR1Q1 


0 


9 


106.13 


TIAVNMLHLHLIEGVIG 


54 


M 


AF073967.1 


2 ... 649 


948 


OR2Hn 


0 


6 


33.33 


LLGTCVMQVQSLSSFW 


88 


M 


AL078630.1 


48786 . . . 
47851 


94 9 


OR7EnP 


5 


3 


90.04 


MVACDVLDLHIIDSFGL 


54 


M 


AF073989.1 


547 ... 
1515 


950 


OR7A14 


0 


*19 


17.72 




71 


M 


AC073772.1 


227187 
226252 


951 


OR1B1 


0 


9 


106.13 


FYGVTLVHLRLIEGLMG 


49 


M 


AC068902.1 
1 


83719 ... 
84647 


952 


OR12D2 


0 


6 


33.23 


LHGS ST IHLHMLVTI AG 


81 


M 


AL359381.1 


105330 
104407 


953 


OR7EnP 


4 


3 


11.92 


MVAC DVLDLH 1 1 DS FGL 


55 


M 


AF073989.1 


547 ... 
1515 


954 


OR8BnP 


5 


15 


74 .31 


LXVVEGMGAHCWVNIV 


82 


M 


AC069559.8 


96224 ... 
95292 


955 


OR1L1 


0 


9 


106. 13 


MLGNSLIHLHLVEGVIT 


57 


M 


AC023167.7 


60743 ... 
61663 


956 


ORllAn 


0 


6 


33.36 


FGATCTSVLVLTLSCLI 


76 


M 


AL359381.1 


175785 
176720 


957 


OR7AnP 


4 


12 


44 .29 


HLLDCYIRTTLSG 


55 


M 


AF102534 .1 


52 . . . 
669 


958 


OR1C1 


0 


1 


254 .35 


LWNSGVHLHLIVGLAT 


56 


M 


AC073769.1 


133488 
132556 


959 


OR1D2 


0 


17 


2.99 


LVVANLLYIHLLTGIFI 


50 


M 


AF073967.1 


2 ... 649 


960 


OR1L3 


0 


9 


106.13 


MLGNSFFHLHLAEGSVA 


53 


M 


AC023167.7 


14677 ... 
15636 


961 


OR12Dn 
P 


1 


6 


33.36 


LHGSATIHLHMSTGIAG 


76 


M 


AL359381.1 


105330 
104407 


962 


OR4G1P 


4 


16 


83.04 


KHGGMAIHSQFVNSISG 


47 


M 


AB030896.1 


1 ... 906 


963 


OR2B4P 


1 


6 


33.53 


LLGSCGSNVQLLLGLLM 


90 


M 


AL359352.1 


95024 . . . 
95965 


964 


OR11H1 


0 


22 




FFGTCLCWIPLCLSVIG 


61 


M 


AC027184.3 


54955 ... 
54017 


965 


OR4 Fn 


0 


16 


83.04 


IHGGMVIHSQFVNSLTC 


DU 


M 




D C -J *J ... 

61317 


966 


OR56An 
P 


5 


11 


4.73 


MNLPSFQLPVLQAGFLS 


38 


M 


AF121975.1 


50 . . . 
1012 



138 
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PCT/US00/27582 



SEQ 
ID # 


Symbol 


D 


C 


to 

cooircl , 


CDR 


% 


S 


Acc 


Range 


967 


0R8NnP 


I 


A 

H 


I D *j . i J 


RF T I RVDAFLKKTANMI 


34 


M 


AF102528.1 


52 . . . 
669 


968 


OR7EnP 


b 






MVACDVLDLHIFFDFGL 


54 


R 


AF091580.1 


7 ... 663 


969 


OR4Pn 


0 


11 


50.95 


LHGGIVGHSQLVNSIAV 


56 


M 


AB030895.1 


1 ... 924 


970 


OR6Cn 


0 






LIGVFCSTPPLGFATLF 


51 


M 


NM 010991. 
1 


1 ... 939 


971 


OR5BCn 
P 


2 


11 


54.3 


GCQIHFLLANIF 


41 


M 


AC069561.1 
0 


51687 . . . 
50743 


972 


ORlOQn 
P 


4 


11 


54.3 


MLGGCGLLQLLLVSVLV 


48 


M 


AC012302.5 


54283 ... 
55224 


973 


OR5BnP 


6 


11 


54.3 


TDASNGGNIHELVTNIF 


45 


R 


U50948.1 


34 . . . 

978 


97 4 


ORlOPn 
P 


2 


12 


115.61 


MIGICTTTTHLVATFI I 


46 


M 


AF247657.1 


1 ... 945 


97 5 


OR1L4 


0 


9 


106.22 


mrMr^M C r* T U TTD T \/FT\7TT 


62 


M 


AF073967 . 1 


2 ... 649 


976 


OR2APn 
P 


3 


12 


115.61 




49 


M 


AF073987.1 


2 ... 649 


977 


OR1L6 


0 


9 


106.22 


MMGNSG I H FRLVETV I T 


63 


M 


AF073967.1 


2 ... 649 


97 8 


OR6UnP 


6 


12 


115.61 


DIGAFTLFMPLDLAALG 


52 


M 


NM 010991. 
1 


1 ... 939 


97 9 


OR5C1 


0 


9 


106.06 


MAADCAGSVHLLICIQA 


50 


R 


X80671.1 


203 ... 
1129 


980 


ORllIn 
P 


1 


15 


70.72 


FGAACGCLITLATSVTI 


51 


M 


AL359381.1 


175785 
176720 


981 


OR4AnP 


6 


11 


50 . 78 


t vrrwururnvvwnvru 
I, i kjoV Von tyv v r*o v v 


57 


M 


AB030896 . 1 


1 ... 906 


982 


OR4GnP 


14 


2 


114.4 5 




4 2 


M 


AB030892 . 1 


1 ... 939 


983 


ORlOVn 


0 


11 


56.15 


MVGGCGLLPLLLISVLI 


48 


M 


AL136158.1 
4 


29455 . . . 
30402 


984 


OR4G2P 


2 


2 


114 .45 


KHGGMAIHSQFVNSISG 


48 


M 


AB030896.1 


1 ... 906 


985 


ORlOVn 
P 


3 


11 


56.15 


MI GRCGLLQLLMVS FLV 


45 


M 


X92969.1 


8035 . . . 
8961 


986 


OR4F4 


0 


2 


114.45 


IHGGMVIHSQFVNSLTC 


50 


M 


AC019272.4 


62255 . . . 
61317 


987 


OR4G3P 


14 


19 


63.51 


I CRKMAVHSQFVNS I SA 


42 


M 


AB030892.1 


1 ... 939 


988 


OR5AKn 
P 


4 


11 


52.82 


LGATCSMNINFLFVNLC 


65 


R 


U50948.1 


34 ... 

978 


989 


ORlOYn 
P 


14 


11 


56.15 


MIRGCGLLFLLLCGHHL 


43 


M 


AF247657.1 


1 ... 945 


990 


OR4GnP 


2 


19 


63.51 


KHGGMAIHSQFVNSISG 


48 


M 


AB030896.1 


1 ... 906 



139 
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SEQ 
ID # 


Symbol 


D 


c 


Mb 

coord 


CDR 




b 


Acc 


Rflncjs 


991 


ORnP 


9 


5 


111 . 92 


IMCSRTTYVXQLHGFFT 


Z J 


M 


At u / j?oy. i 


D *i / ... 

1515 


992 


OR4 Fn 


0 


19 


63.51 


I HGGMVI HSQFVNSLTC 


5U 


M 


npfll GOT) A 


cooct 

J J ... 

61317 


993 


OR8A1 


0 


11 


137.56 


LLVICVIGIELVSANIV 


61 


M 


AC069559.8 


96224 ... 


994 


OR8Bn 


0 


11 


137.56 


LCWSGMGAHSVWDVM 


66 


M 


AC069559.8 


120212 
119283 


995 


OR6DnP 


3 


10 


47.91 




55 


R 


AF034 901 . 1 


Z 1 1U ... 
3078 


996 


OR7E14 
P 


7 


11 


16.31 


MAGGELLDLHIMPAFGL 


58 


R 


AF091580 . 1 


7 ... bbj 


997 


OR2M4 


0 






IVLGCALDIVALCCMLF 


57 


M 


AF102537 . 1 


16 ... 
669 ! 


998 


OR4WnP 


3 


X 




LLLLL LLFFII 


36 


M 


AC069559 . 8 


73704 . . . 
74636 


999 


OR4Fn 


0 


19 


63.51 


I HGGMVI HSQFVNSLTC 


50 


M 


AC019272 . 4 


62255 . . . 
61317 


1000 


OR7EnP 


3 






MAGGESLDLHIMPAFGL 


57 


M 


AF073989.1 


547 ... 
1515 


1001 


OR4GnP 


4 


19 


63.51 


KHGGMAIHSQFVNSISG 


47 


M 


AB030896.1 


1 ... 906 


1002 


ORlOJn 
P 


1 






LLGVCGITIQSTISVLL 


60 


M 


X92969. 1 


8035 . . . 
8961 


1003 


OR52En 


0 


11 


4.58 


MHTASIRMPLLGNILLL 


71 


M 


AF121979. 1 


53 ... 
1106 


1004 


OR4RnP 


24 


11 




VHGAIMGHVXSFANNCL 


54 


M 


AF102522 . 1 


40 ... 
660 


1005 


OR4Cn 


0 


11 




AHGAIVGHIQFVNSICL 


75 


M 


AF102522 . 1 


40 ... 
660 


1006 


OR4AnP 


10 


11 




GLGGIVGHIQL 


44 


M 


AF102522.1 


40 . . . 
660 


1007 


OR4AnP 


4 


11 




LHGGVAGHFQWNGGCI 


55 


M 


AB030895. 1 


1 ... 924 


1008 


OR4AnP 


8 


11 




LHGGVAGHSHSVNGICV 


54 


M 


AF102522.1 


40 . . . 

bbU 


1009 


OR9Gn 


0 


11 


52.54 


FAAYCVGN I I KMLLNVC 


4 6 


M 


AC07 4 1 / / . 4 


105361 


1010 


ORlOAn 


0 


12 


59.65 


MFGSCGSVLQWASTFIF 


64 


M 


AF247657.1 


1 ... 945 


1011 


OR4Cn 


0 


11 




VHRGWGHIQFINSICL 


73 


M 


AF102522.1 


40 . . . 

660 



140 



WO 01/27158 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


1012 


ORlOVn 
P 


8 


11 


56.15 


. FFFFIIXNEXSVWLV 


37 


M 


AC073945.4 


110931 
111893 


1013 


ORlOUn 
P 


3 


12 


03 . 00 




56 


R 


AF034898 . 1 


1 . . . 981 


1014 


OR7E2P 


3 


11 


90 . 37 




SQ 

»j ^ 


M 


AF073989 1 


547 . . . 
1515 


1015 


OR7E35 
P 


6 


4 


11.87 


MAGGEFLDLHIVPAFVL 


53 


M 


AF102536.1 


22 . . . 
669 


1016 


OR9KnP 


0 


12 


59.71 


LAIVGGCSLQVSLSIIP 


49 


R 


AF091579.1 


7 ... 663 


1017 


OR7E13 
P 


5 


11 


90 . 37 


MAGGEFLDLHIMLAfc GL 


04 


D 

n 


nt U27XOOU. X 


7 ... 663 


1018 


OR7EnP 


4 


8 


6.5 


MLACGVLDLH 1 1 DS FGL 


£ c: 
33 


M 

M 




. » > 

669 


1019 


OR9Kn 


0 


12 


59.71 


LAIVGGCSIQMSLSIIP 




lur 

M 


1 " 


X ... .7 ** U 


1020 


ORnP 


13 


11 


137 .56 


PCVI YGIDVHSLXEPAY 




M 

M 




JOt Ji ... 

35322 


1021 


OR7EnP 


8 


11 


72.11 


MAGGNLFFSLLMPAFGL 


C A 

34 


mt 
M 


/\t u / j y o y . i 


S4 7 

1515 


1022 


OR7EnP 


5 


3 


140.64 


MAGGKFLDLHIMPAFGL 


53 


M 


AF073989. 1 


547 ... 
IRIS 


1023 


OR3A4P 


0 


17 


3.12 


LHAGCMFNTQALAAMGA 


44 


M 


AC073769.1 


133488 
1 "*?SS6 


1024 


OR8QnP 


9 


11 


137.56 


LSIIIVETEFVFTXIVT 


33 


M 


AC069559.8 


137090 
138039 


1025 


OR7EnP 


2 


11 


72.11 


ILACGVLDLHIMHNr GL 


0 0 


M 


2\ TTPl 7 "3 Q£ Q 1 
/it U / . i. 


Sd 1 
1515 


1026 


OR7EnP 


3 


3 


140. 64 


MVACG VLDLH 1 1 H S FGL 


56 


M 


AF073989.1 


547 ... 
1515 


1027 


OR3A1 


0 


17 


3.07 


LHVGCACNTHALVGMAT 


50 


M 


AF073967.1 


2 ... 649 


1028 


OR5Gn 


0 


11 


52.52 


MGEACGMSTHFLLAIGL 


69 


M 


AF146372.1 


509 ... 
1456 


1029 


OR5MnP 


7 


4 


42.45 


LI I I YVYNAQRI I IMLE 


39 


M 


AF073987.1 


2 ... 649 


1030 


OR7EnP 


1 


3 


136.02 




Sd 


M 


AF073989 1 


547 . . . 

1515 


1 0^1 

-L V -J -L 


OR5G1 P 


2 


11 


52.51 


QGVACG INTHNVVAVG F 


68 


M 


AF146372. 1 


509 ... 
1456 


1032 


OR5PnP 


3 


11 


6.93 


LVGTCAGNSFCPSSVLS 


70 


M 


AF121977.1 


262 ... 
1197 
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SEQ 
ID # 


Symbol 


D 


C 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


1033 


OR10AE 
nP 


8 


1 


157.36 


IIIIIGIMVIVQIHCVV 


40 


M 


X92969.1 


8035 . . . 


1034 


OR3A2 


0 


17 


3.07 


LHAGCACNTHALVGMAT 


50 


M 


AC073769.1 


133488 
132556 


1035 


ORlOJn 


0 


1 


157.4 


MVATCGIMLHANVSVIV 


88 


M 


X92969.1 


8035 . . . 
8961 






2 


17 


2 . 94 


LVVANLFYIHLLTGIFI 


50 


R 


Y07557.1 


1 ... 942 


1037 


ORlOJn 


0 


1 


157.36 


TVAICGIMVQSNVRVIV 


72 


M 


X92969.1 


8035 . . . 
8961 


IUjO 




n 


17 


2 . 99 


LWTNLLYLLLLTGI FT 


49 


R 


Y07557.1 


1 ... 942 


1039 


OR5GnP 


8 


11 


52.51 


QGVV Y VAN THAWAVLV 


55 


M 


NM 013728. 
1 


1 ... 948 




OR KnP 
vjrvfl our 


i 

j. 




50 . 99 


LHGC I GG H I QLVN SI AG 


61 


M 


AB030895.1 


1 ... 924 


1041 


OR5GnP 


4 


11 


52.51 


LGWCGVSTHFLLVLGL 


75 


M 


AF146372.1 


509 ... 
1456 


1042 


OR9HnP 


2 


1 


254 .35 


FSGI AGWNAQMLLCI I S 


59 


R 


AF091579.1 


7 ... 663 


1043 


OR1A1 


0 


17 


2.99 


MIGNSGINPHLMGVIFV 


86 


M 


AF073966.1 


41 ... 
643 


1044 


OR1A2 


0 


17 


2.99 


MIAKSGISPHLMLGVFL 


80 


M 


AF073966.1 


41 . . . 

643 


1045 


OR8AnP 


6 


11 


137.68 


FL V I C VMV I E LVFANL I 


50 


M 


AC069561.1 
0 


51687 . . . 
50743 


1046 


OR1P1P 


1 


17 


2.99 


LLGDIALLTRLLLGVI I 


82 


M 


AF102538. 1 


139 . . . 

675 


1047 


OR7E12 
p 


7 


11 


1.92 


MAGGEFFSLHIMPAFGL 


55 


M 


AF073989.1 


547 . . . 
1515 


1048 


OR4A1P 


4 


11 




LHGGWGH FQVVNGICV 


57 


M 


AB030896.1 


1 ... 906 


1049 


OR10G3 


0 


14 


1.7 


LHGSCGAHLQLTDIVVS 


91 


M 


AF259072.1 


19582 . . . 
18644 


1050 


OR10G1 
P 


3 


14 


1.7 


LHGSCGAHIQLTDIVAS 


93 


M 


AF259072 . 1 


55611 . . . 
54658 


1051 


OR10G2 


0 


14 


1.7 


LHGSCGAHIQLTDWAS 


91 


M 


AF259072. 1 


55611 . . . 
54658 


1052 


OR5Tn 


0 


11 


51.94 


MVGTCAAH IHALFVIEV 


52 


M 


AF121977.1 


262 ... 
1197 


1053 


OR7EnP 


8 


3 


136.02 


MVACGVLDLHIIGSFGL 


53 


R 


AF091580.1 


7 ... 663 


1054 


OR7EnP 


5 


3 


136.02 


MAGGKFLDLHIMPAFGL 


54 


M 


AF073989.1 


547 ... 
1515 


1055 


0R4AnP 


2 


11 


50.93 


LHAGWGHVQFMNGICV 


61 


M 


AB030895.1 


1 ... 924 


1056 


OR4C1 


1 


11 


50.93 


LHGGI IGHVQFVNSMCL 


66 


M 


AB030896.1 


1 ... 906 
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SEQ 
ID # 


Symbol 


D 


c 


Mb 

coord 


CDR 


% 


S 


Acc 


Range 


1057 


ORlEnP 


7 


17 


2.9 


. MMMYTLIMGILI 


80 


M 


AF073961.1 


32 . . . 
649 


1058 


OR7KnP 


11 


14 


5. 99 


MIGCNFIELYMMIGIFG 


49 


R 


AF091580.1 


7 ... 663 


1059 


OR4CnP 


3 


11 


50.93 


LHDGIEGHIQFVNSMCA 


61 


M 


AF102522.1 


40 . . . 
660 


1060 


ORlRnP 


11 


17 


2.9 


MVGISAVHLHLIEGWA 


44 


R 


M64377.1 


1 ... 939 


1061 


OR5AUn 


0 


14 


1.22 


MAATCGANIHCLFANLS 


51 


M 


AC069559. 8 


85584 . . . 
84655 


1062 


OR4Cn 


0 


11 


50. 96 


LHAGWGHIQFVNSICI 


69 


M 


AF102522.1 


40 . . . 

660 i 


1063 


OR4Cn 


0 


11 


50.96 


VHGCIVGHVQLLNSICV 


57 


M 


AB030895.1 


1 ... 924 


1064 


OR13Dn 
P 


2 


9 


86.89 


MLGSCWITLRLFTVIVL 


58 


M 


AJ251154.1 


2703 . . . 
1747 


1065 


OR5n 








ASASLTSYVHNEEEVFV 


44 


M 


AL359352.1 


111313 
112242 


1066 


OR2Hn 








LLGTCVMQVQSLSSLW 


83 


M 


AL078630. 1 


48786 . . . 
47851 


1067 


ORn 










25 


M 


AC074177.4 


88434 . . . 
88916 


1068 


ORn 








EINLLLARGKAL 


29 


M 


AF283814 . 1 


1 ... 930 


1069 


ORn 








NNNNNFXSLHLCCCILI 


29 


M 


AC074177.4 


128803 
129726 


1070 


ORn 








TLLLLTFQHHL 


27 


M 


L14569. 1 


62 . . . 
667 


1071 


OR6Fn 








. .CCCWPIPTSAIAVIS 


46 


R 


M64386. 1 


130 . . . 
975 












ILLLLL 


33 


R 


U50947.1 


418 ... 
1350 


1073 


ORn 








. .CCCLIPFFFTSGYSW 


24 


R 


M64392. 1 


1 ... 942 


107 4 


OP 1 OAn 








PLGECDPEEQMYVGLVM 


51 


M 


AF247657.1 


1 ... 945 


1075 


ORn 








IPNASRRRRRR PP 


25 


R I 


M64388. 1 


1 ... 942 


1076 


OR2Ln 








FLAG AG I NAH Y VS T FL F 


51 


M 


AF102527 . 1 


22 ... 
669 


1077 


ORlOJn 








LTGICGIMVQSNVSVLL 


57 


M 


X92969.1 


8035 ... 
8961 


1078 


ORlKn 








LLLLLMVNL YL I KG WT 


50 


R 


M64377.1 


1 ... 939 


1079 


ORlODn 








LHGSCGLHILLSNVISG 


69 


M 


AC074177.4 


12106 . . . 
13038 


1080 


ORn 








CCCIII 


41 


R 


M64376.1 


1 ... 999 
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ID # 




u 


Q 


Mb 

coord 


CDR 


% 


s 


Acc 


Range 


1081 


0R2Ln 








SLACGGLNAH FVRTLS F 


52 


M 


AF102537.1 


16 . . . 
669 


1082 


ORn 








HHHHHRLESSSLLLLLL 


38 


M 


AC073945.4 


152209 
153150 


1083 


ORn 








LLLLLS 


27 


M 


AL365336.1 


41087 ... 
41711 


1084 


0R2n 










57 


M 


AF102521.1 


22 . . . 
669 



5 Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it will be apparent to 
those skilled in the art that various changes and modifications can be practiced without 
departing from the spirit of the invention. Therefore the foregoing descriptions and 
examples should not be construed as limiting the scope of the invention. 

10 

All patents, patent applications, and publications cited herein are hereby incorporated 
by reference in their entirety. In particular, the following documents are hereby incorporated 
by reference in their entirety: United States Provisional Patent Applications Serial 
Nos. 60/145,412, filed July 23, 1999; 60/155,126, filed September 22, 1999; 60/158,495, 
15 filed October 8, 1999; 60/158,615, filed October 8, 1999; 60/181,1 13, filed February 8, 
2000; 60/181,1 15, filed February 8, 2000; 60/184,809, filed February 24, 2000; 
60/188,332, filed March 9, 2000; and United States Patent Applications Serial 
Nos. 09/620,753, filed July 21, 2000; and 09/621,122, filed July 21, 2000. 
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CLAIMS 

What is claimed is: 

5 1 An isolated and purified polynucleotide sequence encoding an olfactory 

receptor and having the nucleotide sequence selected from the group consisting of SEQ ID 
NO:l through SEQ ID NO: 73 and SEQ ID NO: 1 1 1 through SEQ ID NO: 1 52, or a 
nucleotide sequence that is at least about 95% homologous to a nucleotide sequence of the 
group consisting of SEQ ID NO:l through SEQ ID NO: 73 and SEQ ID NO:l 1 1 through 

1 0 SEQ ID NO: 1 52 and encoding a polypeptide having olfactory receptor function. 

2. An expression vector comprising a polynucleotide sequence of claim 1 . 

3 . A host cell comprising the expression vector of claim 2. 

15 

4. An isolated and purified olfactory receptor polypeptide comprising the 
translated sequence of SEQ ID NO: 1 through SEQ ID NO: 73 and SEQ ID NO: 1 1 1 
through SEQ ID NO: 152, or a polypeptide sequence that is at least about 95% homologous 
to a polypeptide sequence of the group consisting of the translated sequence of SEQ ID 

20 NO: 1 through SEQ ID NO: 73 and SEQ ID NO: 1 1 1 through SEQ ID NO: 1 52 and having 
olfactory receptor function. 

5. A host cell expressing a polypeptide of claim 4 or a functional fragment 

thereof. 

25 

6. A phage expressing a polypeptide of claim 4 or a functional fragment 

thereof. 

7. A preparation containing a polypeptide of claim 4, further comprising 
30 biological or synthetic molecules which maintain the functional structure of the 

polypeptide. 
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8. An isolated and purified polynucleotide sequence encoding an olfactory 
receptor and having the nucleotide sequence selected from the group consisting of SEQ ID 
NO: 153 through SEQ ID NO: 1084 or a nucleotide sequence having a sequence at least 
about 95% homologous to a nucleotide sequence of the group consisting of SEQ ID NO: 

5 1 53 through SEQ ID NO: 1 084 and encoding a polypeptide having olfactory receptor 
function. 

9. An expression vector comprising a polynucleotide sequence of claim 8. 
10 1 0. A host cell comprising the expression vector of claim 9. 

11. An isolated and purified olfactory receptor polypeptide comprising the 
sequence of SEQ ID NO: 1085 through SEQ ID NO: 2008, or a polypeptide sequence that 
is at least about 95% homologous to a polypeptide sequence of the group consisting of SEQ 

1 5 ID NO: 1 085 through SEQ ID NO: 2008 and having olfactory receptor function. 

12. A host cell expressing a polypeptide of claim 1 1 or a functional fragment 

thereof. 

20 1 3. A phage expressing a polypeptide of claim 1 1 or a functional fragment 

thereof. 

1 4. A preparation containing a polypeptide of claim 1 1 , further comprising 
biological or synthetic molecules which maintain the functional structure of the 

25 polypeptide. 

15. A library of olfactory receptors suitable for determining the interaction 
pattern of a composition with the receptors, comprising the expression products of at least 
two polynucleotides of SEQ ID NO: 1 through SEQ ID NO: 73, SEQ ID NO:l 1 1 through 

30 SEQ ID NO: 1 52, and SEQ ID NO: 1 53 through SEQ ID NO: 1 084 wherein said 

polynucleotides encode functional olfactory receptors; or functional fragments of said 
expression products. 
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16. A library of olfactory receptors according to claim 1 5, wherein the library 
comprises the expression products of at least 50 polynucleotides of SEQ ID NO:l through 
SEQ ID NO: 73, SEQ ID NO:l 1 1 through SEQ ID NO: 152, and SEQ ID NO: 153 through 
SEQ ID NO: 1084 wherein said polynucleotides encode functional olfactory receptors; or 
functional fragments of said expression products. 

17. A library of olfactory receptors according to claim 1 5, wherein the library 
comprises the expression products of at least 100 polynucleotides of SEQ ID NO:l through 
SEQ ID NO: 73, SEQ ID NO:l 1 1 through SEQ ID NO:152, and SEQ ID NO: 153 through 
SEQ ID NO: 1084 wherein said polynucleotides encode functional olfactory receptors; or 
functional fragments of said expression products. 

18. A library of olfactory receptors according to claim 1 5, wherein the library 
comprises the expression products of at least 200 polynucleotides of SEQ ID NO:l through 
SEQ ID NO: 73, SEQ ID NO.i 1 1 through SEQ ID NO:152, and SEQ ID NO: 1 53 through 
SEQ ID NO: 1084 wherein said polynucleotides encode functional olfactory receptors; or 
functional fragments of said expression products. 

19. A library of olfactory receptors according to claim 1 5, wherein the library 
comprises the expression products of at least 500 polynucleotides of SEQ ID NO:l through 
SEQ ID NO: 73, SEQ ID NO:l 1 1 through SEQ ID NO: 152, and SEQ ID NO: 153 through 
SEQ ID NO: 1084 wherein said polynucleotides encode functional olfactory receptors; or 
functional fragments of said expression products. 

20. A library of olfactory receptors suitable for determining the interaction 
pattern of a composition with the receptors, comprising at least two polypeptides of SEQ 
ID NO: 1085 through SEQ ID NO: 2008, wherein said polypeptides are functional 
olfactory receptors; or functional fragments of said polypeptides. 

21. A library of olfactory receptors according to claim 20, wherein the library 
comprises at least 50 polypeptides of SEQ ID NO: 1085 through SEQ ID NO: 2008, 
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wherein said polypeptides are functional olfactory receptors; or functional fragments of 
said polypeptides. 

22. A library of olfactory receptors according to claim 20, wherein the library 
5 comprises at least 100 polypeptides of SEQ ID NO: 1085 through SEQ ID NO: 2008, 

wherein said polypeptides are functional olfactory receptors; or functional fragments of 
said polypeptides. 

23. A library of olfactory receptors according to claim 20, wherein the library 
10 comprises at least 200 polypeptides of SEQ ID NOS of SEQ ID NO: 1085 through SEQ 

ID NO: 2008, wherein said polypeptides are functional olfactory receptors; or functional 
fragments of said polypeptides. 

24. A library of olfactory receptors according to claim 20, wherein the library 
15 comprises at least 500 polypeptides of SEQ ID NO: 1085 through SEQ ID NO: 2008, 

wherein said polypeptides are functional olfactory receptors; or functional fragments of 
said polypeptides. 

25. A method for determining the binding pattern of a composition with 
20 olfactory receptors, comprising the steps of: 

exposing the composition to a library according to claim 21 ; and 
determining whether the composition binds to each olfactory receptor, thereby 
determining the overall binding patter of the composition. 

25 26. The method of claim 25, wherein the composition consists essentially of one 

compound or chemical. 

27. The method of claim 25, wherein the composition comprises at least two 
compounds or chemicals. 

30 

28. The method of claim 25, wherein the step of determining whether the 
composition binds to each olfactory receptor further comprises a determination of the 
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10 



approximate binding constant with which the composition binds to each receptor or 
functional fragment thereof. 

29. The method of claim 25, further comprising the step of determining whether 
a receptor or functional fragment thereof to which the composition binds is activated. 

30. The method of claim 29, father comprising the step of determining the 
absolute or relative amount by which the receptor or funcuonal fragment thereof is 
activated. 

31. a DNA array or a DNA chip comprising DNA segments derived from SEQ ID 
NO: 153 through SEQ ID NO: 1084. 

32. A method of determining differences among individuals with respect to their 
15 olfactory faculties, comprising the steps of comparing the olfactory DNA of the individual 

against the array or chip of claim 3 1 . 

33. A method to determine single nucleotide polymorphisms in olfactory receptors, 
comprising the steps of uniquely amplifying olfactory receptor sequences from DNA 

20 obtained from one or more individuals, based on primers designed according to the first 25 
bases and the last 25 bases of any combination of, or each of, SEQ ID NO: 153 through 
SEQ ID NO: 1084, and determining the similarities and differences between said amplified 
DNA and the corresponding receptor from SEQ ID NO: 153 through SEQ ID NO: 1084. 
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atgatggtgg atcccaatgg caatgaatcc agtgctacat acttcatcct aataggcctc 
ccLgtttag aagaggctca gttctggttg gccttcccat tgtgctccct ctaccttatt 
gctgStag gtLStgac aatcatctac attgtgcgga ctgagcacag cctgcatgag 
Jccatgtata tatttctttg catgctttca ggcattgaca tcctcatctc "cctcatcc 
atacccaaaa tgctggccat cttctggttc aattccacta ccatccagtt tgatgcttgt 
ctgctacaga tgtt^gccat ccactcctta tctggcatgg aatccacagt gctgctggcc 
atggcttSg accgcLtgt ggccatctgt cacccactgc gccatgccac agtacttacg 
ttgcctcgtg tcaccaaaat tggtgtggct gctgtggtgc ggggggctgc actgatggca 
ccccttcSg tcttcatcaa gcagctgccc ttctgccgct ccaatatcct ttcccattcc 
tactgcctac accaagatgt catgaagctg gcctgtgatg atatccgggt caatgtcjtc 
tatggcctta tcgtcatcat ctccgccatt ggcctggact cacttctcat ctccttctca 
tatctgctta ttcttaagac tgtgttgggc ttgacacgtg aagcccaggc "aggcattt 
ggcacttgcg tctctcatgt gtgtgctgtg ttcatattct atgtaccttt ""ggattg 
Jccatggtgc atcgctttag caagcggcgt gactctccgc tgcccgtcat cttggccaat 
atcta^ctgc tggttcctcc tgtgctcaac ccaattgtct atggagtgaa gacaaaggag 
attcgacagc gcatccttcg acttttccat gtggccacac acgcttcaga gccc 



600 
660 
720 
780 
840 
900 
960 
970 



ctcattgttg gtactggttt gctcagatcc ttggatcacc tcccgatctt ggtggttggc 
tgtagggga? Laccctggt cacctctgtt gtggtgatcc ttgtctccta catgtcttcc 
ScSSS tcctaggaat tcccttagct tctggaaaac ^gagcctt ctcc.cctjt 
acctcccact tgactgctgt tagcctgtac tatgaaacaa ctatgtacac ttacttgccc 
gcctcgcgac a?gga?ccgg ggcaggaaat cagattgtgt cagtatttta tacaatggtg 
atcccSgt taaatcctct catctatagt ttgagaaatg aggaagtgaa agttgcccta 
tgaaaaacat tgagacatag tccttaatct tctattgagt gtctcaaaaa tgcaaaatat 
tctgtgaaga 

<210> 407 
<211> 934 
<212> DNA 

<213> Unknown (H38g256 nucleotide) 
<220> 

<223> Synthetic construct 

CUrtB aaaaaaaaca agattctaac gtgacagaac ttgttcttct Wjcctatc. 60 
tStcttggg agctgcagct atttctctta ttactatttt tgttttttta cattgctatt 120 
gtcc?gggaa acctcttgat agtggtaaca gtgcaagccc atgctcatct ^tccaatct 180 
cctatgtatt attttttagg tcatctctct ttcattgacc tatgcctaag etgtgttaet 240 
ctgccaaaga tgttagggga tttcctacag cagggcaaga J«"tcttt "caggatgc 
ctaacccaaa tctacttcct ccactttcta ggagccagtg agatgttttt gctgacagtc jou 
atggccta?g acaggtatgt tgccatctgt aaccctttgc gctaccttat aagtcatgaa 420 
^ccccagSa tgcctttggt tggttcttgc ctgctggtgt gw^tttt. ^ccactctat 48 
catgcaggtc atactagtca tccagctgcc tttctgtggc cccaatgaac tggacaactt 540 
ctactgtgat gtcccacagg tcatcaagct ggcctgcatg gacacctatg tggtagaggt 600 
gctgaSta gccaacagtg gtctgctctc tcttgtctgc ttcttggtct tactattctc 660 
tta?gc?gtc atcctgatca ccctgagaac acacttcggc cagggccaga *caagttcct 720 
ctctacctgt gcttctcacc tgacagtggt cagcctgatc ttcatgccat gtatattcat 780 
ctatttgagg cctttctgca gcttctctgt ggataagata ttctccatgt tttacacagt 840 
gatgacacc? atgttgagcc ccctcatcta cacactcaga aatgctgata tgaagacagc 900 
tatgaagaag ctgaggataa aaccatgtga catt 

<210> 408 
<211> 954 
<212> DNA 

<213> Unknown (H38g257 nucleotide) 
<220> 

<223> Synthetic construct 
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Met Gin val He Leu vll lie Gin Leu Pro Phe Cys Gly Pro Asn Glu 



150 1" 160 

He Gin Leu . 
165 170 1 

Leu Asp Asn Phe Tyr Cys Asp Val Pro Gin Val lie Lys Leu Ala Cys 



180 - « - 

Met Asp Thr Tyr Val Val Glu Val Leu Met lie Ala Asn Ser Gly Leu 



Leu Ser III Val Cys Phe Leu Val Leu Leu Phe Ser Tyr Ala Val He 



200 205 



L eu lie Thr Leu Arg Thr His Phe Gly Gin Gly Gin Asn Lys Phe Leu 



215 



220 



III Thr Cys Ala Ser hIs Leu Thr Val Val Ser Leu lie Phe Met Pro 



230 



235 240 



Cys lie Phe He Tyr Leu Arg Pro Phe Cys Ser Phe Ser Val Asp Lys 



250 



lie Phe Ser Met Phe Tyr Thr Val Met Thr Pro Met Leu Ser Pro Leu 

275 280 

He Tyr Thr Leu Arg Asn Ala Asp Met Lys Thr Ala Met Lys Lys Leu 
««« 9 q q 300 

290 z " 

Arg He Lys Pro Cys Asp He 
305 310 

<210> 1340 
<211> 318 
<212> PRT 

<213> Unknown (H38g257 protein) 
<220> 

<223> Synthetic construct 

„r M et 3 Val Asp Pro Asn Gly Asn Glu Ser Ser Ala Thr Tyr Phe He 
Leu He Gly Leu Pro Gly Leu Glu Glu IL Gin Phe Trp Leu Ala Phe 
Pro Leu Cys Ser Leu Tyr Leu lie Ala Val Leu Gly Asn Leu Thr He 
He Tyr He Val Arg Thr Glu His Ser Leu His Glu Pro Met Tyr He 
Phe Leu Cys Met Leu Ser Gly He Asp He Leu He Ser Thr Ser Ser 
Met Pro Lys Met Leu IL He Phe Tr P Phe Asn Ser Thr Thr He Gin 
Phe Asp Ala Cys Leu Leu Gin Met Phe Ala He His Ser Leu Ser Gly 



265 270 



Met Glu Ser Thr Val Leu Leu Ala Met Ala Phe Asp Arg Tyr Val Ala 



105 



120 



125 



He Cys His Pro Leu Arg His Ala Thr Val Leu Thr Leu Pro Arg Val 

13 5 

Thr Lys He Gly Val Ala Ala Val Val Arg Gly Ala Ala Leu Met Ala 

Pro Leu Pro Val Phe He Lys Gin Leu Pro Phe Cys Arg Ser Asn He 

Leu Ser His Ser Tyr Cys Leu His Gin Asp Val Met Lys Leu Ala Cys 

180 185 

Asp Asp lie Arg Val Asn Val Val Tyr Gly Leu lie Val He He Ser 



195 200 205 

Leu Lys Thr Val Leu Gly Leu Thr Arg Glu Ala Gin Ala Lys Ala Phe 

230 



Ala lie Gly Leu Asp Ser Leu Leu He Ser Phe Ser Tyr Leu Leu He 



225 
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Gly Thr Cys Val Ser His Val Cys Ala Val Phe lie Phe Tyr Val Pro 

245 250 
Phe lie Gly Leu Ser Met Val His Arg Phe Ser Lys Arg Arg Asp Ser 

260 265 
Pro Leu Pro Val lie Leu Ala Asn lie Tyr Leu Leu Val Pro Pro Val 

275 280 
Leu Asn Pro lie Val Tyr Gly Val Lys Thr Lys Glu lie Arg Gin Arg 

290 2 ^5 300 

lie Leu Arg Leu Phe His Val Ala Thr His Ala Ser Glu Pro 
305 310 315 

<210> 1341 
<211> 320 
<212> PRT 

<213> Unknown (H38g258 protein) 
<220> 

<223> Synthetic construct 

<221> VARIANT 

<222> (1)...(320) 

<223> Xaa = Any Amino Acid 

Met°Ser 3 Ser Arg Leu Met Asn Val Phe Ser Met Glu Thr lie Asn Phe 

1 5 10 

Val Ser Cys Leu lie Leu Met Gly Phe Pro Ser Ser Pro Glu Met Gin 

20 25 
Leu Leu Tyr Phe Gly Leu Phe Ser Val Ala Tyr Thr Leu Thr Pro Met 

35 40 45 

Gly Asn Ala Ala lie Val Cys Ala Val Trp Xaa Asp Gin His Leu His 

50 55 
Thr Pro Met Tyr Thr Leu Leu Gly Asn Phe Ser Leu Leu Glu He Trp 

65 70 75 80 

Tyr Val Thr Ala Thr Lys Leu Leu Ala Asn Phe Leu Ser Thr Ser Lys 

85 90 

Ser lie Ser Phe Met Ser Cys Phe Ala Gin Phe Tyr Phe Phe Ser Leu 

100 105 11 ^ 

Gly Tyr Asp Glu Gly Phe Phe Leu Cys lie Thr Ala Phe Asp Arg Tyr 

115 I 20 125 

Leu Ala lie Cys Arg Pro Leu Arg Tyr Pro Cys lie Met Thr Lys Gin 

Val Cys Thr Gly Leu lie He Phe Ala Trp Ser Cys Val Phe Val lie 

145 150 155 160 

Phe Leu Thr Leu Val He Leu lie Ser Gin Leu Ser Tyr Cys Gly Pro 

165 l" 70 175 

Asn lie lie Asn His Phe He Cys Asp Pro Val Pro Leu Lys Met Leu 

180 I 85 1 

Ser Cys Ser Glu Asp lie lie lie Thr Gin Leu lie Tyr Ser Thr Phe 

195 200 205 

Asn Ser Val Phe lie lie Gly Thr Phe Leu Phe lie Leu Cys Ser Tyr 

210 215 220 

Ala Leu Val lie Leu Ala lie lie Arg Met Pro Ser Glu Ala Gly Lys 
225 230 235 240 

Arg Lys Ala Phe Ser Thr Cys Ala Ser His Leu Ala Val Val Thr Leu 

245 250 
Phe Tyr Gly Ser lie Met Val Met Tyr Val Ser Pro Gly Ser Ala His 

260 265 270 

Pro Ala Lys Asn Glu Lys He He Thr Leu Phe Phe Ser Val He Thr 

275 280 285 

Pro Leu Cys Asn Pro Leu He Tyr Ser Leu Arg Asn Lys Glu Met Lys 



651 



17 1 1*> 
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FIGURE 1 

SEQ 'l I GGNTTATNCC NCGTTGNACT GCAGGGGNNC AACNCACAGN ACGCCCGNTG CTGAGGCTAT AAATGANCGG 
71 NTTAAGGAGA GGAGTGAAGA CAGTAAAAAA ACACAGAGAT AAATTTATCA ATTGGGAAGC TTTCAAAGGG 
141 CCAAATATAG ATGAATATTA ATGGGCCAAA GAAGAGAAGC ACAACAGTAA TGTGGGCAGA CAGAGTGGAA 
211 AGGGCCTTGG ACATCCCATC AGAGGCTTGG CGATGCACAG TAGCAAGGAT GATAGTGTCA GAAATGAGCA 
281 AAAGGAGGAA ACACATAAGT GAGAGCAGAC CACTGTTAGT GAGCACCAGT ATCTCAAAAC CATAGGTGTC 
351 TAAGCAGGCA AGCTTGATCA CTAGGAGGAG GTCACAGAAA AAATTGTCTA CCCTGTTGGG TCCACAGAAA 
4 21 GGCAGATTGA CTTTGAATGC CAGGTGGGTG GCTGAGTGTG AGATGCCAAT GGCCCAGGAA ACCCCCACCA 
4 91 GAACAGTTCA CACCCTCCGG TTCATGATGG TTATGTAGTG CAGAGGTTTG CATATAGCAA TGTATCTATC 
561 ATAGGCCATG GCAACAAGAA GCACCATCTC ACTACCCCCA AAAACATGCA AGN 

SEQ 'l I GGNNTNTNAC ACGGACTCCA AGCAGTGGTA ACAACGCAGA GTACGCCCGT TCCTGAGTGA GTAGATGAAG 
71 GGGTTCAGCA TGGGATTGAT GACAGTGTTG AAAATTCCAA CAGCTTTATC CTTGTCTGAA AGCTTGGTTG 
141 AACCCAGTCG CAT AT AG TT A AAGATACCTG AACCATAGAA TATGGCAACC ACAGTGAGGT GGGAGCCACA 
211 TGTGGAGAAG GCTTTCTTCC TGCCCTCTAC AGAGCGAATT CGCAGGACTG CAGCTGCCAC GTGGATATAG 
281 GAGATGACAA TGAGAGCCAT GGGGGTACCT GCCATTATAA AACCCACAGC AAAAAGCAGC AGCTCATTGA 
351 GTTGGGTGCT GGAGCAGGAG AGCTGGAAGA GCTGTGGGAG GTCACAGTAG AAGTGATTGA TCACATTGGG 
421 GCCACAGAAG TTGAGCGTGG ACATGGCCAC AGTGTGGGTC AGTGCGTTGG TGAAAGCACA AGCCCAGGAC 
4 91 GCAGCCACCA ACATCCTCTG GACTGTCTGA CTCATGCGGG TGCTTGTAGG TGAGGGGCCC GGCAGATGGG 
561 CAGGAATCGG TCATAGGG 

SEQ 'l I TGGNNTTTTA TCNCCNTTGG AGCTCCNAAG CAGTGGTAAC AACGCAGAGT ACGCCCGTTG CGAAGCGTGT 
71 AGATTAGGGG GTTCAGTAGG GGAGTGATGA CAGTGTAGGT CACCGAGATC AGCTGGTCAT GTTCTCTGGT 
141 GTTCTCTGAC TTGGGCTTGA GGTAGGCAAT GGAGGCACAG CTGTAGTGGA CAATGACCAC AGTGAGGTGG 
211 GATGCACAGG TGGCAAAAGC CTTCTTCCGG CCCTCAACTG AAGTAATCTT GAGGATTGTA GAGATAATGA 
281 GAACATAAGA AATGAAAACC AGACCCATAG GTACAACAAG CACCAGCACA CTGATAATCA AAGTCAGGAT 
351 TTCATTGACA GTGGTGTCAA TGCAGGAGAG CTTCATCACA GGGCGGATGT CACAGAAGAA GTGGGGCACC 
421 TTTTCTAGCA CAGAAGGGTA ACCTGAATAC AGATGTCACT TGCGTTATTG CTACAATCAG CCCAATGCTG 
4 91 CAAGGCCCCC AGGACAAGTT GGATACGCAG CCTCTTGTTC ATAATAACCA TGTATCTCAA GGGGGTTGCA 
561 AGATGGCCAC ATAGCNGNTC ATATTCCN 

SEO ID NO • 4 

1 GTNGTTNTTA ACNCCATTGG AGCTCCAAAG CAGTGGTAAC AACGCAGAGT ACGCCCCCAA TGTATTTTTT 
71 TTTGAGAAAC TTGTCTTTCT TAGATTTTTG TTACATCTCT GTCACAATTC CAAAATCTAT TGTTAGTTCC 
141 TTGACTCATG ATACTTCCAT TTCTTTCTTT GGGTGTGCTC TGCAAGCCTT CTTTTTCATG GACTTGGCAA 
211 CTACGGAGGT AGCCATCCTT ACAGTGATGT CCTGTGACCG CTATATGGCC ATCTGCCGGC CTTTACATTA 
281 TGAGGTCATC ATAAACCAAG GTGTCTGTCT GAGGATGATG GCCATGTCGT GGCTCAGTGG GGTGATCTGT 
351 GGATTCATGC ATGTGATAGC AACATTCTCA TTACCATTCT GTGGGCGCAA TAGAATACGT CAATTTTTCT 
421 GTAATATTCC ACAACTNCTA AGCCTCTTAG ACCCCAAAGT AATTACCATT GAGATTGGAG TCATNGGNTT 
491 TTGGTACAAG TCTTGNGATA ATCCTCTTTG NTGNAATTAC TCTCTCCTAC ATGTNCATTT TTTTTTGNCA 
561 TCATGAGGGA TTCCTTCTAA AGG 

SEO I D NO ■ 5 

1 GNGGNTTNNT NCCNCCNTTG GACTCCAAAG CAGTGGTAAC AACGCAGAGT ACGCCCGTGT GTAAATGAAT 
71 GGGTTCAACA TGGGAGTCAT AACAGTGTAG GACAATGATA GCAGCTTCGT GCCCTCAGGT GAATTATTTG 
141 ATTTAGGCCG GAAGTAGGTG AGGCTTAATG ATATATAGAA AAGAGAGACA ACAAGGAGGT GTGAGGAACA 
211 TGTAGAAAAG GCTTTATTCT TCCCTTTAGC TGATGGGATC TTGAGGATGG CAGCAGCAAT GCGAGTATAG 
281 GAACACAAGA TCAGCAAGCA GGGGATCATG ACCACCAGAA TGGTTCCGAC GATGGCGTAG ATCTCAAACA 
351 GTGCTGTGTC TGCACAGACC AGCCTCAGCA CAGGTGGGCT GTCACAGAAG AAGTGGTTCA CCTTGTTGGT 
421 GCCACAGAAT GGAAAACTGA AGAGCCATGT GGTCTGCACA GTAGCTACAG GAAAGCCTGG GAACCAGGAG 
491 GCAGCAGCCA GTTTGGCACG AGTCCTTTGG TTCATGATGA CTGGGTAGTG CAAGGGACTN GCAGATNNNC 
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561 NCATTCGGTC ATATGNCATG GNAG 
;q. ID NO: 6 

1 CNTTGGAGCT CCAAAGCAGT GGTAACAACG 
71 AGGGTCGGGG GCACGACTGT GTAGAACGCA 
141 GCTTCACATA GGCAATGCTG CCAGATATCA 
211 AAATGTTTTT GATCTACCCT CCTTAGAAGG 
281 GTAATTACAA CAAAGGAGAT TATCACAAGA 
351 CTTTGGGGTC TAAGAGGCTT AGGAGTTTGT 
421 AGAATGGTAA TGAGAATGTT GCTATCACAT 
4 91 CATCATCCTC AGACAGACAC CTTGGTTTAT 
561 GCGGTCATAG GA 



CAGAGTACGC CCGCTCCGCA GAGAATAGAT GAAAGGGTTC 
GACAGGAAAA CATCCAGAAC GGGGGGAGAA TTTGAAATTG 
TAAAGAGTGT TACAACCACA AGATGTGGAA TGCAGGTAGA 
AATCCTCATG ATGACAGAAA AAATGTACAT GTAGGAGAGA 
CTTGTACCAA AAACCATGAC TCCAATCTCA ATGGTAATTA 
GGAATATTAC AGAAAAATTG ACGTATTCTA TTGCGCCCAC 
GCATGAATCC ACAGATCACC CCACTGAGCC ACGACATGGC 
GATGACCTCA TAATGTAAAG GCCGGCAGGA TGGCCATATA 



Q. ID NO:7 

1 GCAGTGGTAA CAACGCAGAG TACCGCCCCC 
71 GGTTTCTCCT CAGTGACTTG TCCCAAAATG 
141 AAGACTGTGT CTGCCAGCTT TTCTTCTTCC 
211 GGCCTATGAC CGCTTCACTG CCATCTGTTA 
281 GTGGCCCTGG CTGTGGGCAC ATGGCTGTTA 
351 CCTTGCCACA CTGTGGTCCC AATGAAGTGG 
421 CTGTGCTGAC ACATCCTTAG CCCAGAGGGT 
4 91 TGCTAAATCT TTTATCCTAC ACTAGAATCA 



TATGTACTTT TTCTTGGGAA ACTTGTCTGT GTTTGACATG 
CTGCTCTACC TTATGGGGCT GGGCCGACTC ATCTCCTACA 
ATTTCCTCGG GAGCATTGAG TGCTTCTTGT TTACGGTGAT 
TCCTCTGCGA TACACAGTCA TCATGAACCC AAGGATCTGT 
GGGTGCATTC ATTCCAGTAT CTTGACCTCC CTCACCTTCA 
ATCACTTCTT CTGTGACATT CCAGCACTGT TGCCCTTGGC 
GAGCTTCACC AACGTTGGCC TCATATCTCT GGCTGCTTTC 
CAAATATCTA TCTTAAGCAT TCGTACAAC 



SEQ. ID NO: 8 ^ 
1 GGAACAACGC AGAGTCGCCC CCGATGTACT 
71 TTCCACCACC ATTCCAAAAA TGCTGATGAA 
141 CTCATGCAGA TGTATTTTTT CATACTCTTT 
211 ACCGGTTTGT GGCCATCTGT CACCCCCTGC 
281 GGTTCTGGCA TCCTGGACCA TGAGTGCTCT 
351 TTCTGCACAG CCTTAGAAAT CCCCCACTTT 
421 ATAGCTTTCT TAATCACATG GTGATATATT 
4 91 CCTTTACTTC TTACTCTAAG ATAATTTCTT 



TGTTCTTCTC CAACCTGTCC TTTGCTGACA TTTGTGTTAC 
CATCCAGACA CAGAACAAAG TCATCACCTA CATAGCCTGC 
GCTGGATTTG AAAACTTCCT CCTGTCCGTG ATGGCCTATG 
ACTACATGGT CATTATGAAC CCTCACCTCT GTGGACTGCT 
GTATTCCTTG CTACAAATCT TAATGGTAGT ACGACTGTCC 
TTCTGTGAAC TTAATCAGGT CATCCAACTT GCTTGTTCTG 
TTACAGTTTG CGCTGCTGGG TGGAGGTCCC TGACTGGGAT 
CATACATGCA ATCTCANCAA GNTCAGGG 



Q. ID NO: 9 

1 GGGTTTTNAC CCNNTNGGAG CTCCNAGCAG 
71 TGAAGGGGTT GAGTGAGGGA GTCACCACTC 
141 GGAGGAGGGG GGCTGAAGGT ACATGCTGAT 
211 GAGCATGTCC CAAAGGCCTT TNTCCTTCCC 
281 CATAGGAAGC AAGAATTAAG CATANTGGGA 
351 GTTAGAACCC TTTTCACCAC AGGCAATCTT 
421 TTGAGACCAC ACAGTGGNAA TTTGTATTGT 
491 AACCACNACN GCGGNAACTA ANGATTCAGA 
561 AGAATGGCCA CATACCGNTC AAA 



TGGTAACAAC GCAGAGTACG CCCGTTTCGT AGGCTATAAA 
CATAGAAGAG GGCCATGAAC TTGGGTTGAT CCCTTGAGAT 
GGCTGGGCCA TAAAATAAGA AAACTACAAT AAGATGGGAG 
TTGGAAGATT TGATCTTAAA TACAGCACTT NCAATACTAG 
CAGCTAACAT AAAAATGCAT ACCACAGAGA GTGTGAGCTC 
TATCAGAACA GGAATCTCAC ACACCAAGTG GTCCAGCTTA 
GGCAGTGGCC CTCTGAGAAC GGCATAGATT ATACCAANTT 
CGCNCTGGAT TCATGATGAG GGTNTAGTGA AGAGGTTNTC 



Q. ID NO:10 

1 GCTGCTNCCA GCAGTGGTAA CAACGCANAG 
71 CTGGATGTCT GCTTCATCAC CACTACCATC 
141 TCTCCTTTGT ATGTTGCATG ACCCAGATGT 
211 GGCTTTCATG GCCTATGACC GNTATGNTGC 
281 AAGGTCTGTG TCAGGCTTGT GGGAACTGCC 
351 TTTCATTCCT AGAGCCCTTC CGCAGAGACA 
4 21 TTGGCCTCTT TTGTGGGGGA CCCTNANANT 
4 91 TCTNAGNCCC ATNGGTGCTN ACTGNTACTT 
561 CTCCTTCTN 



TACGCCCCCA ATGTATTTGT TCTTCGGCCA TCTGTCTCTC 
CCACAGATGT TGATCCACCT CGTGGTCAGG GACCACATTG 
ACTNTGTCTT CTGTGTTGGT GTGGCCGAGA GCATCCTCTT 
TATCTGCTAC CCACTTAACT ATGTCCCGAT CATAAGCCAT 
TGGNTCTTTG GGCTGATCAA TGGCATCTTT NTCGGGTATA 
ACCACATAGA AAGCTTCTTC TGCGAGGCCC CCATAGTGAT 
AGTCTGTGGG CAAATCTTTN GCCGATGCCA TCGTGGTAAT 
ACCTATNTGC ACATTCCTGT CCACCATCCT AGNNAAAGTC 



SEQ. ID NO: 11 
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1 GGNNTTTTAC CNCNATTGGA GCTGCAAAGC AC _A AC~ CGCCCCCTAT ^TCTTC 
71 TTGAGAAACT TGTCTTTCTT AGATTTTTGT JACATCTCTG TCACAATTCC acttggcaac 
141 TGACTCATGA TACTTCCATT TCTTTCTTTG GGTGTGCTCT GCAAGCCT1U TTTACATTAT 
211 TACGGAGGTA GCCATCCTTA CAGTGATGTC CTATGACCGC JATATGGCCA GTGATCTGTG 
281 GAGGTCATCA TAAGCCAAGG TGTCTGTCTG AGGAJGATGG CCATGTC^ ^ mTCTG 
351 GATTCATGCA TGTGATAGCA ACATTCTCAT ™CCATK. AGATTGGAGT CATGGTTTTT 

IS! SE SSSSJ SSS5S SEES Sc™«« ™» T 

561 GAGGATTCCT TCTAAGGAGG GG 

SEQ. ID NO: 12 Ba rrArTCGT AACAACGCAG AGTACGCCCT CTTGTCCTCG TGCCGATACA 

1 GGNTTTGACC ACGGAGCTCC AAGCAGTGGT AACAACGCAG TTCTTGCCCT CAGGTGAATT 

71 TGATGGGGTT CAACATGGGA GTCATAACAG TGTAGGACAA TGMAG^ AGACAACAAG GAGGTGTGAG 
141 ATTTGATTTA GGCCGGAAGT AGGTGAGGCT TAATGATAia GATGGCAGCA GCAATGTGAG 

211 GAACATGTAG AAAAGGCTTT ATTCTTCCCT "AGCTGATG gATCTTGAG ^ ^g^AGATCTC 

281 TATAGGAACA CAAGATCAGC AAGCAGGGGA TCATGACCAC CAG AGAAGAAGTG GTTCACCTTG 

351 AAAGAGTGCT GTGTCTGCAC AGACCAGCCT CAGNACAGGT J TACAGGAAAG CCTGGGAACC 

: 2 S ; ss is ss s™ 

561 ATGGCCACAT ANCGGTCNT 

SEQ. ID NO: 13 ^ ntl „. r TCTTAACAAC GCAGAGTACG CCCCCAATGT ATTTATTCTT 

1 GNNNTTNNNN CCACTGGAGC TCCAAAGCAG TGGTAACAAC G ™£ GCTGCTGTGG AACCTAAAAG 

71 GCTCACCTCT CCTTAGTTGA TATCTGTTTT ACCACCAGTA "GTCCCCCA JGGGCTCCAC 
141 GACCTGACAA AACAATCACA TTCCTGGGTT GTGTCATCCA «C GCAAACCTCT CCACTATACC 

211 TGAGTGTGTC CTCCTGGCTG TAATGGCTTT TGATCGCTAT GCTGCAG1 GGAAACACTC 
281 GCCGTAATGA ACCCTCAGCT GTGCCAGGCT CTGGCAGGGG TTGCGTGG^ TCTTCGTGAG 
351 TTATCCAGGG CACTGTCACC CTCTGGCTTC CTCGClbl TCAGCTCTTT GTTGCTTCAC 

JS SSSS ££££ SS SSS TATGGACATR TAGCCAAGGT 



561 TCAAGTCAGT CCAGCCT 



SEQ. ID NO: 14 rraarrarTr GTAACAACGC AGAGTACGCC CATACATGAT GGGGTTCAGT 

1 GGNNTNTNAC TCCATGGACT CCAAGCAGTG GTAACAAU£ * QGTGTTCTCT GACTTGGGCT 

71 AGGGGAGTGA TGACAGTGTA GGTCACCGAG ATCAGCTGGT CATGTTCTCT AGGTGGCAAA 
141 TGAGGTAGGC AATGGAGGCA CAGCTGTAGT GGACAATGAC CACAGTGA CATA AGAAATGA AA 

211 AGCCTTCTTC CGGCCCTCAA CTGAAGCAAT CTTGAGGATT GNAGAGATAA ^ ACAGTGGTGT 

281 ACCAGACCCA TAGGTACAAC AAGCACCAGC ACACTGATAA TCAAA ACCTTTCTAG CACAGAAGGG 

351 CAATGCAGGA GAGCTTCATC ACAGNGCGGA TGTCACAGAA £AAG TGCNG GCCCC CAGGACAAGT 

i£ SS ~cc anggggttgc NGATNNCCAC ATAGTNGCAT 

561 ANACCATTGC TGNGAGC 

SEQ. ID NO: 15 ^„, m ,. r rnrTKGTAAC AACGCAGAGT ACGCCCATTA CGAAAAGTGT 

1 GNCGNTNTTA ACNCCATTGG AGCTCCAAAG ^GTAAC AACGC CCTTTGTTGA GCAGTTTGGA 

71 AGATGAAGGG GTTCAAGAGG GGTGTGATGA TGCAGCTCAG gCGGAGG CAC TGTAAGATGC 

141 CTGAGCCTCT GACATACGAA TGTAGAGAAA JATCGAACTG CCATAGA GAGAACAGTG GCAAGAATGG 
211 GAGGCGCAAG TGGAAAACGC "TCCTTCGC JCAGCAGCTG TAGGGGC GAGAGCATGA AAGCCACCAG 
281 AGGCATAGGA AACTGAGGTC AGAGCCAGTG JGCCCAGTAA CACCAA GTGGTCAATA 
351 TTTCAGCAGG TGGGTGTCCC CACAAGAAAG CCTGAGCAAG GGCC GGACCCAGAG GAATNCANCT 

\l\ AGCCAGGAGG SESKS SSS "AT TNATTAGGGT CTCATAGCGG AGTTGTCGNC 
561 AGATTTGCNT GGTNACGATT CAN 



SEQ. ID NO: 16 
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1 GGNNTTTTAC CNCNATTGGA CTCCAAAGCA GTGGTAACAA CGCAGAGTAC GCCCCCTATG TATTTATTCT 
71 TGCTCACCTC TCCTTAGTTG ATATCTGTTT TACCACCAGT ATTGTCCCCC AGCTGCTGTG GAACCTAAAA 
141 ggSSSX AAACAATCAC ATTCCTGGGT TGTGTCATCC AGCTCTACAT CTCCCTGGCA TTGGGCTCCA 
li\ ptgSgt'gt' CCTCCTGGCT GTAATGGCTT TTGATCGCTG TGCTGCAGTT tgcaaacctc tccactatac 
28 cgcSSSg aScScagc TGTGCCAGGC TCTGGCAGGG GTTGCGTGGC tgagtggagt gggaaacact 
Hi SStcSgg gcactgtcac cctctggctt ccccgctgtg gacaccgatt gctccaacat ttcttcgtga 
42 gSaccc?cc atgattaagc ttgcatgtgt ggacatccat gataatgagg ttcagctctt tgttgcttca 

491 ctStCTTGC TCCTCTTGCC CTTAGTGCTA ATACTGCTGC CTATGGACAT ATAGCCAANG TGGCATAAAG 
561 GATCAAGTCA GTCCAGG 

SEQ 'i I gnnnot™tt CANTCCATTG GGCCCTCTAG atgcatgctc GAGCGGCCGC CAGTGTGATG GATATCTGCA 
7 Stcgccc TTATTCCGGA GGGTATACAT GAAGGGATTG gtaactagac GTAAACTCGA AGCCAAGAAC 

ill aSaJSctc ttagaaaaga gaattgaaac taaagagaaa gaactagcaa agaaggaaat attgaatata 

2i ca^gIgagag gagacagatg atggaacaag actctgaaag aggtggaagg gattgaatac aatcaaaagt 

111 a^gtgactg ctagttccaa gatggtggcg taggggcaag ctggctttgc ttacccccct ggcagaaaac 

35 caIaScaa^ tagcaccaag attatcacta gcaatatccc agaactcaca tataaggatg agacagttcc 

421 SgmJcCAG AGAAGATCAG AAGCACAAGT GGGAGAAGTC AGCTTTGGAT GCTACTTTGT TCTAAGGGAG 

9 SgSggg AGGATGATTG cagatgtata ttcaatgtta taaaacagcc cataaaacaa agattggaaa 
111 JS??£S? ttgcaaccag gagcaaatac tgggaaaggc gaattccagc cacttgcngc c 

SEQ ' i^nStnan tcantgccct ngggccctct agatgcatgc tcgagcggcc gccagtgtga tggatatctg 

7 ! CAGAATTCGC CCTTGTTGCG CAAGGTGTAA ATGAAAGGGT TTGCG.CAGGA GTAAATGAAG GGATTACGCA 
141 ggactaaaS aagggattac GCAGGAGTAA atgaagggat TACGCAGGAG TAAATGAAGG GATTACGCAG 
21 gaSSga mggattacg caggagtaaa tgaagggatt acgcaggagt aaatgaaggg ATTACGCAGG 

2 AGTAAATGAA GGGATTACGC AGGAGTAAAT GAAGGGATTA CGCAGGAGTA AATGAAGGGA TTACGCAGGA 
35 GTAAATGAAG GGATTACGCA GGAGCAAATA CATAGGAAGG GCGAATTCCA GCACACTGGC GGCCGTTACT 
421 AGTGGATCCG AGCTCGGTAC CAAGCTTGAT GCATAGCTTG AGTATTCTAA CGCGTCACCT AAATAGCTTG 
9 GcSSS? GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG 
561 CCCGGAAGCA TAAAGTGTAA AGNCTGGGGT GCCTAATGAG TGACTTACTC CATTA 

SEQ "i I ?nS™att CCATCCATTG TCOCTTCAGA TGCATGCTCG AGCGGCCGCC AGTGTGATGG ATATCTGCAG 
7 SSSSSS TCTTGGTTTT TGTGCTGATA GATCATGGGA TTCAGCATGG GGGTGACCAC AGTGTACATC 

14 SgaggSg Sgcacttga GTGTGAGTTG CGGGTGGCAG CAGAACTAAG GTACACCCCT AGGATTGCAC 
1 SSSSSaa ggagacIIct GAGAGGTGAG ATGCACAGGT ggaagatgcc ttgtacttcc cctgagctga 

28 Sag^gS tgtatggaan gaaattatnt tanaagtaag agtaaagnat nccagtcagg ggnancnttc 

35 IcScAGN TGCAANTTGT AAAAATTATA TTCAANCNAT NTGNATTTAA NGAAAANCCT TATCANGTAN 
421 ACACTGCNAA GNTNTGNATT NANCCCTNGN ANTTAANNTT TCNACAAGAA AATAANGTGC GTTNNAATCT 
ill SSSct CTNTCNCCAT TAANGTCNAN TCCNTCCNTA TCCCTTTTCN NATTTTGNAN TCNNGANTAC 
lei Scnatttc TNTNNTNNCT GACCTACTAA CCNATTNAGT TACNACAAGN CCNTTCNANT 

631 NCTCGCANGT TNTCCCTCTT NNCANNTNCC CNTTNTTNTC CCTNTTCCCC ATCTNC 

SEQ 'l I ?rA?TGGCCC TCTAGATGCA TGCTCGAGCG GCCGCCAGTG TGATGGATAT CTGCAGAATT CGCCCTTCCT 
7 StSc CTTTCCTGGT TCTCAAACTC TTCAGCTCTC TCTCTTTATG CTTTTTCTGG 

ill JgSgScat CCTCACAGTT AGTGGTAATG TGGCTATCTT GATGTTGGTG AGCACCTCCC ATCAGTTGCA 
21 JacccccSg SSJSttc TGAGCAACCT CTCCTTCCTG GAGATTTGGT ATACCACAGC AGCAGTGCCC 
28 SSSSS SScCTACT GGAGAGAAGT CAGACCATAT CATTTACAAG CTGTCTTTTG CAGATGTACT 
ll\ ™^„c ATTAGGCTGC ACAGAGTACT TCCTCCTGGC AGCCATGGCT TATGACCGCT GTCTTGCCAT 
421 ScACTACG GAGCCATCAT GAGTAGCCTG CTCTCAGCGC AACTGGCCTT GGGCTTCTGG 

491 GTGGNTGGGT TCGGGGGCAA TGCAGTGCCC ACAGGCCTTC AATCAAGTGG GCTGNTCCTT CTGGTGGCCC 
561 CCGGTGCCAA TCAACCACTT TNTTTTTGGG ACAATTGCAN CCCTGGAATT GGCCC 
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SEQ. ID NO:21 

1 GNNCTTANTT CAATCCCACC NANCCNTGCC 

71 AGAATTCGCC CTTCCTATGT ATTTACTCTT 

141 TTTATGCTTT TTCTGGTGAT GTACATCCTC 

211 CNTCCCATCA GNTGCATACC CCCATGTTNT 

281 CNCAAGCNGC ANNGCCCAAA GCTTTGCNCA 

351 TTTNTGGCTN CCTTNCCTCT TNCTTCNCTC 

4 21 CNTNNCNCCT NTAATNTTCC NCCTNTTCTN 

4 91 CCNTNCTNCT NACTTCANNC TCNGNCNNTN 

561 CTCCNTTTAT ATCGCCTCTT CTCNTNCTTC 

631 CGTNCNNTAT CTANNCTCCT NTTTCNGTCC 

701 TCGCATCGTN GCANCATCTC CTCCCATCTC 



GANGCATGCT CGNGCGGCCG CCAGTGTGAT GGATATCTGC 
ACTGGGCTTT CCTGGNTCTC AAACTCTTCA GCTCTCTCTC 
ACAGTTAGTG GTAATGTGGC TATCTTGATG NTGGTGAGCA 
TCTTTCTGAG CNACCTCTCC TTCCTGGAGA TTTGGTATNC 
TCTTATTGCN CAGANGCNNN CCNNTACANN NACNCTCCTG 
ANN TACT NCN TCTNCTNTAG TNTCTTTCTT CTCTNTCNCT 
NTTTCTNTTT TCCCTNCTCT GTTTCACCCC TACCTCTTAT 
NNNCNCCNNT AAATNTANGN NNANNNTNNN ATNTNCTCTT 
CNNTTCTCTC TCCTCANNCA TATCNANTNT NTTCTACTCT 
TNCTTCTCCT NTCATTTCTA TATTNCTTCT CANACANTNT 
CTGTNCNCTN TTCCN 



SEQ. ID NO:22 

1 GNNNTTAANT CATTCCCCNC TCNATGCATG 

71 CCCTTGTTTC GGAGGCAGTA GATGAATGGG 

141 TACAGGATGA GATATAAATG AAACAAATGC 

211 CCACCTGGAA GGGTAGGCTG TTTGTGAATA 

281 TAAAGAATGT GAACGATGTG CTCACAGAGG 

351 ACAAAGGACA CGGTGGCATG AGCGCATGTA 

421 GAGTGGAATA CAGTACAGGG CTTTGTAAAC 

4 91 TGAAAATGGG TTCTGAAGGC CAGAAGCCCA 

561 TCAGGACGTA ATTCTTGGCA GTTGCTAGAA 



CTCGAGCGGC CGCCAGNGTG ATGGATATCT GCAGAATTCG 
TTGATGGAAT CTGAGACAGT GCTCTAGAAT CTGTGTTTCA 
TAAATAATGA CACAAGGTAC CTTGCCGAGA GAGGAATCAT 
ATGTAGGGTG GGAGAGAAGG CTTTACTAAG GAGATGGGCT 
CCACAGAAGA GAAATTATAG CCAGGAGAAC AACCTGAAAG 
ACACAATGTA CTCAGGAAAT GGCTGGCATC CTGAGATATG 
TCAGCTTGGA GTCAGATCAC AGAAAGCCTT GACAAGGAAC 
TTCAAGATTC CCAAAGGGAA AAACACAAAT CAGCTTGGTT 
TTACATCAGA AAGGAGGTTC ACNT 



SEQ. ID NO:23 

1 GNNNTNANTC ANNCANTGGG CCCTCTAGAT 
71 ATTCGCCCTT CCTATGTATT TCCTCTTACT 
141 ATGCTTTTTC TGGTGATGTA CATCCCCACA 
211 CCCATCAGTT GCATACCCCC ATGTACTTCT 
281 AGCAGCAGTG CCCAAAGCAC TGGCCATCCT 
351 TTGCAGATGT ACTTTGTTAT CTCATTAGGC 
421 GCTGTCTTGC CATCTGCTAT CCTTTACACT 
4 91 CCTGGGCTCC TGGGTGNGGG GGTTCGTGGC 
561 NTCTGGGGGC CCCCGGGCCA TTNACCACTT 



GCATGCTCGA GCGGCCGCCA GTGTGATGGA TATCTGCAGA 
GGGCTTTCCT GGTTCTCAAA CTCTTCAGCT CTCTCTCTTT 
GTTAGTGGTA ATGTGGCTAT CTTGATGTTG GTGAGCACCT 
TTCTGAGCAA CCTCTCCTTC CTGGAGATTT GGTATACCAC 
ACTGGGGAGA AGTCAGACCA TATCATTTAC AAGCTGTCTT 
T GC AC AG AG T ACTTCCTCCT GGCAGCCATG GCTTATGACC 
ACGGAGCCAT CATGAGTAGC CTGCTCTCAG CGCAGCTGGC 
CATTGCAAGT GCCCACAAGC CCTAATCAGT GGCCCTGTCC 
TNTTCTGGGA CAATTGCACC CCTGGAATTG G 



SEQ. ID NO: 24 

1 TNNTTAANTC ATTCCNTTGN CCCTCNAGAT 
71 ATTCGCCCTT TCCTTGTTAC TGAGGGAGTA 
141 CTGTGTTTCA TACAGGATGA GATATAAATG 
211 GAGGAATCAT CCACCTGGAA GGGTAGGCTG 
281 GAGATGGGCT TAAAGAATGT GAACGATGTG 
351 AACCTGAAAG ACAAAGGACA CGGTGGCATA 
421 CTGAGATATG GAGTGGAATA CAGTACAGGG 
4 91 GACAAGGAAC TGAAAATGGG TTCTGAAGGC 
561 ACTTGTTTTC AGGACGTATT CTTGGGCAGT 

SEQ. ID NO:25 

1 GNNNNTTANT CCATGCCCCT CTAGATGCAT 
71 GCCCTTGTTT CGCAGCCTAT AAATGAAGGG 
141 ATACAGGATG AGATATAAAT GAAACAAATG 
211 TCCACCTGGA AGGGTAGGCT GTTTGTGAAT 
281 TTAAAGAATG TGAACGATGT GCTCACAGAG 
351 GACAAAGGAC ACCGGTGGCA TAAGCACATG 
4 21 TGGAGTGGAA TACAGTACCG GGGCTTTGTA 
4 91 AACTGAAAAT TGGGTTCTTG AAGGCCAGAA 



GCATGCTCGA GCGGCCGCCA GTGTGATGGA TATCTGCAGA 
GATTAGGGGA TTGATGGAAT CTGAGACAGT GCTCTAGAAT 
AAACAAATGC TAAATAATGA CACAAGGTAC CTTGCCGAGA 
TTTGTGAATA ATGTAGGGTG GGAGAGAAGG CTTTACTAAG 
CTCACAGAGG CCACAGAAGA GAAATTATAG CCAGGAGAAC 
AGCGCATGTA ACACAATGTA CTCAGGAAAT GGCTGGCATC 
CTTTGTAAAC TCAGCTTGGA GTCAGATCAC AGAAAGCCTT 
CAGAAGCCAT TCAAGATTCC CAAAGGGAAA AACACANATC 
TGCTAGAATT ACATCAGAAA GG 



GCTCGAGCGG CCGCCAGTGT GATGGATATC TGCAGAATTC 
GTTGATGGAA TCTGAGACAG TGCTCTAGAA TCTGTGTTTC 
CTAAATAATG ACACAAGGTA CCTTGCCGAG AGAGGAATCA 
AATGTAGGGT GGGAGAGAGG GCTTTACTAA GGAGATGGGC 
GCCACAGAAG AGAAATTATA GCCAGGAGAA CAACCTGAAA 
TAACACAATG TACTCAGGAA ATGGCTGGCA TCCTGAGGTA 
AACTCAGCTT GGAGTCAGAT CCAGAAAGCC CTTGACAAGG 
GCCATTCAAG GATTCCCCAA AGGGGAAAAA CACAAATCAA 
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561 GCTTGTTTTC AGGGACCGTT AATTCTGGGG CCAGGTTGCT TGAATTACCT TCANGAAAGG GAGGTTCACA 
631 CT 



Q. ID NO:26 

1 GNNCTTATTC ATCCCCCTCT AGATGCATGC TCGAGCGGCC GCCAGTGTGA TGGATATCTG CAGAATTCGC 
71 CCTTTCTTTG TTCCTCAGAG TGTAGATTAG GGGGTTGATG GGGTTGATGG AATCTGAGAC AGTGCTCTAG 
141 AATCTGTGTT TCATACAGGA TGAGATATAA ATGAAACAAA TGCTAAATAA TGACACAAGG TACCTTGCCG 
211 AGAGAGGAAT CATCCACCTG GAAGGGTAGG CTGTTTGTGA ATAATGTAGG GTGGGAGAGA AGGCTTTACT 
281 AAGGAGATGG GCTTAAAGAA TGTGAACGAT GTGCTCACAG AGGCCACAGA AGAGAAATTA TAGCCAGGAG 
351 AACAACCTGA AAGACAAAGG ACACGGTGGC ATAAGCGCAT GTAACACAAT GTACTCAGGA AATGGCTGNC 
4 21 ATNCTNAGAT ATGGAGNGNG AATACCAGTA CANGGCTTTN TANACTCANC TTGGAGTNCA GAATCACANA 
4 91 ANGCCTTGCA AGGAACTGAA AATGGGTTCT GAAAGGCCAG AAGCCNTTNA AGATTCCCAA AGGGAAAAAA 
561 CACAAATCAA GCTTTTTTNA AGNACNGTAA TTCNTGGNGC CAGTTGCTTA GAATTNCCAT CANAAANG 



Q. ID NO : 27 

! GGNNTAAGCC TTCCCCCTNC GATGCTGCTC GAGCGGCCGC CAGTGTGATG GATATCTGCA GAATTCGCCC 

71 TTCCCATGTA TTTCCTCTTA CTGGGCTTTC CTGGTTCTCA AACTCTTCAG CTCTCTCTCT TTATGCTTTT 

141 TCTGGTGATG TACATCCTCA CAGTTAGNGG TAATGGGGCT ATCTTGATGN TGGTGAGCAC CNCCCATCAG 

211 TTGCATACCC CCATGTACTT CTTTCTGAGC AACCNNTCCN TCCTGGAGAN TTTGGNATAC CACACGCAAN 

281 NAGNGNCCNA AGGCACTTGG NCNTNCTACA GGNGGAGAAG GCTTGACCAT ANNATTTTAC CATGCCTNGC 

351 CTTANGNCAN ACCCNNCTTN TNCCTNTTNT TCCNCTNNNN GGTNNNTCAN CCGCANNCTT NNATCNNNTG 

421 NANCTTCATN GAATATGGNN TNNGTNTNTC TTGAGAGCCT CNNGATCNNA TTTTTTCCAN CNNCTAAAGN 

4 91 GGNGCTTNTC TCTCTNNNAT CTAGCTTNNT GGNTCTCTTT TNTNTNCTNA CCCGTGNTNT CCTATNTGNT 

561 GTCTCTTCCT ACNNNCTGCN NTTATTNTAN ATCANNTCTN NCNTTGCTCT CNTNTACNAC ATNATCATNC 

631 TCNCTCCCCN CTNTCNCTCT CTATNNCNTA CCATCNCTCT CTTCTCATTC ANCTCTTTNT CATTGNTTGT 

701 TCANTTANNC ACTCTCCNTC NCATCTTCTA TNCACTANNT TTNTTNTTTT NCTCTCTANT TCTNNTTCCA 
7 71 NTGTNCACTC CNNTCTTNNC NNTTNCCCTA NCG 



Q. ID NO: 28 

1 GTNNTTNANN NCATTGCCCC TCTNGATGCA TGCTCGAGCG GCCGCCAGTG TGATGGATAT CTGCAGAATT 
71 CGCCCTTCCT ATGTACTTCC TCTTACCGGG CTTTCCTGGT TCTCAAACTC TTCAGCTCTC TCTCTTTATG 
141 CTTTTTCTGG TGATGTACAT CCTCACGGTT AGTGGTAATG TGGCTATCTT GATGTTGGTG AGCACCTCCC 
211 ATCAGTTGCA TACCCCCATG TACTTCTTTC TGAGCAACCT CTCCTTCCTG GAGATTTGGT ATACCACAGC 
281 AGCAGTGCCC AAAGCACTGG CCATCCTACT GGGGAGAAGT CAGACCATAT CATTTACAAG CTGTCTTTTG 
351 CAGATGTACT TTGTTTTCTC ATTAGGCTGC ACAGAGTACT TCCTCCTGGC AGCCATGGCT TATGACCGCT 
421 GTCTTGCCAT CTGCTATCCT TTACACTACG GAGCCATCAT GAGTAGCCTG CTCTCAGCGC AGCTGGCCCT 
491 GGGCTTCTGG GTGGGTGGGT TTCGGGGCCA TTGCAAGTGC CCACAGCCCT TATCAAGTGG CCTGTCCTTC 
561 TGNGGCCCCC GGGCCCATCA ACCACTTTTT TCTGGGGACA ATTGCACCCT GGAATGGCCC 



'l GTNNNTNNNT CCATNCCATT GGGCCCTCTA GATGCATGCT CGAGCGGCCG CCAGTGTGAT GGATATCTGC 
71 AGAATTCGCC CTTTCATGGT TCCGGAAACA GTAAATTATG GGGTTCAGTC ATGGTAACAG GAGGAGGCTG 
141 AGTGTATGGG CATGGATGGG GGCTGTGAAT GTGGCGGGAG CTCATGGATG TGCTCTTCTG AGTGCTTCAC 
211 GTTTCTGAGT GAAATAAGAA GCAAGGTCAT CACCGAGAGG GAGGAGACAG GCTCGGGTGA GTTTAGTGGA 
281 TATGAATCCA AGAGAGACCA TTCAACTTAG TTGTCTATTT TTTTTTTCTC CAGTTATAGT CACTTGCATG 
351 AATGTAGATG TGGAGTACTT GATCATAAGA TCCATTTTAT GGCAGAAGAC ATTATTTTTC TGAGCCTTCT 
421 GCTGTCAGTT TCTAAATAAG CAGGCCAGCC GGGCTGTGCA CCTAAATGTC TGTCTGGGAG GAGGAGGCTG 
4 91 AGAAGTCTTG CAGTCTGCAG GACACCCGAG GAATCGTATT GTGGGAACCG TCCCCGAGAA CCACACGAGC 
561 CGTGCTNCTC AGTNCTGACT GGAANAATGA AATTGNAAGC CAAGTNGTTC NNGGANCNNT 



1 GNNNTTNANN CCATTGCGCC CTCTAGATGC ATGCTCGAGC GGCCGCCAGT GTGATGGATA TCTGCAGAAT 

71 TCGCCCTTCC TATGTATTTC TCTTCCTAAC GATTGGAATG CCTGGGATTA GGCAGATGAT TTTCTTTTTC 

141 CCCCATACCC CTCTATTATT TAGGTGATTG AGTTTAAATC CCTTTATCTA CACCCTTCGG AACAAGGGCG 

211 AATTCCAGCA CACTGGCGGC CGTTACTAGT GGATCCGAGC TCGGTACCAA GCTTGATGCA TAGCTTGAGT 
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981 ATTCTAACGC GTCACCTAAA TAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 
\l\ tcSScaac ATACGAGCCG GAAGCATAAA GTGTAAAGCC tggggtgcct aatgagtgag 

l 2 \ SSSSE ESSSSr tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 
9i gcSacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgctttc tcgctcactg 

561 ACTCGCTGGG CTTCGGTCGN TCGGCTGCGG CGAGCGGGAT CAGCTCACTC AAAAGG 



SEQ. ID N0:31 

1 GNNNNNNNNT CANGCCATTG GGCCCTCTAG 
71 GAATTCGCCC TTCCTATGTA TTTCTCTTCA 
141 CTCCCTCCTC CCATGTATTT TCTCTTCAAT 
211 TTTTCTCTTC CAATCCCTTC ATATACACGT 
281 CTAGTGGATC CGAGCTCGGT ACCAAGCTTG 
351 TGGCGTAATC ATGGTCATAG CTGTTTCCTG 
4 21 AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG 
4 91 CACTGGCCGC TTTCCANGTC GGGAAACCTG 
561 GAGGCGGTTT GCGTATTGGG CGCTNTTTCG 



ATGCATGCTC GAGCGGCCGC CAGTGTGATG GATATCTGCA 
CTTTCTCCGA CATCACTCAC AGCCACCCCA CCCTCAGCCT 
CTCTCCTTCT TTGATATCCT GAACTTTCTG TAGCTCTTTA 
TTCGTAACAA GGGCGAATTC CAGCACACTG GCGGCCGTTA 
ATGCATAGCT TGAGTATTCT AACGCGTCAC CTAAATAGCT 
TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG 
TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTGCGCT 
TCGGCCAGCT GCATTAAATG AATCGGCCAA CGCNCCGGGA 
TTCTTCGNTN ACTGATCGNT GG 



SEQ. ID NO:32 

1 GNNNNNNNNT TCATNCCATT GGGCCCTCTA 
71 AGAATTCGCC CTTGTTGCTT AGAGTGTAAA 
141 ATAGGAAGGG CGAATTCCAG CACACTGGCG 
211 CATAGCTTGA GTATTCTAAC GCGTCACCTA 
281 GAAATTGTTA TCCGCTCACA ATTCCACACA 
351 CTAATGAGTG AGCTAACTCA CATTAATTGC 
4 21 TGCCAGCTGC ATTAATGAAT CGGCCAACGC 
4 91 CCTCGCTCAC TGACTCGCTG CGCTCGGTCG 
561 TAATACGGTT ATCCACAGAA TCAGGGGGAT 



GATGCATGCT CGAGCGGCCG CCAGTGTGAT GGATATCTGC 
TAAAAGGGTT AACATTGGCT TAGAGGTGAA GAGTAAATAC 
GCCGTTACTA GTGGATCCGA GCTCGGTACC AAGCTTGATG 
AATAGCTTGG CGTAATCATG GTCATAGCTG TTTCCTGTGT 
ACATACGAGC CGGAAGCATA AAGTGTAAAG CCTGGGGTGC 
GTTGCGCTCA CTGCCCGCTT TCCAGTCGGG AAACCTGTCG 
GCGGGGAGAG GCGGTTTGCG TATTGGGCGC TCTTCCGCTT 
NTCGGCTGCG GCGAGCGGTA TCAAGCTCAC TCAAAGGCGG 
ACGCANGAAA GAACATGTGA GCAAAT 



SEQ. ID NO: 33 

1 GNTNTNANTC ATGCCCCCNC CGATGCNTGC 
71 CCTTGTTGCG GAGCGAATAT ATGAAGGGGT 
141 CACTGGCGGC CGTTACTAGT GGATCCGAGC 
211 GTCACCTAAA TAGCTTGGCG TAATCATGGT 
281 TCCACACAAC ATACGAGCCG GAAGCATAAA 
351 TTAATTGCGT TGCGCTCACT GCCCGCTTTC 
421 GCCAACGCGC CGGGGAGAGG CGGTTTGCGT 
4 91 CGCTCGGTCC GTTCGGCTGC GGCGAGCGGT 
561 AATCAGGGGG ATAACGCNGG AAAGAACATG 



NCGAGCGGCC GCCAGTGTGA TGGATATCTG CAGAATTCGC 
TAAGGGAAGA GAAAATACAT AGGAAGGGCG AATTCCAGCA 
TCGGTACCAA GCTTGATGCA TAGCTTGAGT ATTCTAACGC 
CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 
GTGTAAAGCC TGGGGTGCCT AATGAGTGAG CTAACTCACA 
CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT TAATGAATCG 
ATTGGGCGCT CTTNCGCTTC CTCGCTCACT GACTCGCTTG 
ATCAANTCAC TCAAAAGGCG GGAATACGGG TTTNCACAGA 
TGAGCCANAN GGCAGCAAAA GGGCNAGGAA T 



SEQ. ID NO: 34 

1 GNNNNNNNNT CANNCCATTG GGCCCTCTAG 
71 GAATTCGCCC TTGTTCCGAA GGCTATAGAT 
141 AGAGAAAATA CATGGGAAGG GCGAATTCCA 
211 CAAGCTTGAT GCATAGCTTG AGTATTCTAA 
281 GTTTCCTGTG TGAAATTGTT ATCCGCTCAC 
351 GCCTGGGGTG CCTAATGAGT GAGCTAACTC 
421 GAAACCTGTC GTGCCAGCTG CATTAATGAA 
4 91 CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 
561 CAAAGGCGGT AATACGGGTA TCCACAGAAT 



ATGCATGCTC GAGCGGCCGC CAGTGTGATG GATATCTGCA 
GAAGGGGTTT TAGGTTTTTA GGAACACAGG CTAAGGGGGA 
GCACACTGGC GGCCGTTACT AGTGGATCCG AGCTCGGTAC 
CGCGTCACCT AAATAGCTTG GCGTAATCAT GGTCATAGCT 
AATTCCACAC AACATACGAG CCGGAAGCAT AAAGTGTAAA 
ACATTAATTG CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG 
TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG 
GCGCTCGGTC GTCGGCTGCG GCGAGCGGTA TCAGCTCACT 
CANGGGATAA CGCAGGAAAA GACA 



SEQ. ID NO: 35 

1 GGNNTTNANT CATTGCCCCG CTNGATGCAT 
71 GCCCTTCCGA TGTATTTTCT TCTACGTTAA 
141 CTGTAATGCT GACAAGATGA AAGAGCCAAA 
211 GGATGTATAT ATATCTATTC AATATATGTA 



GCTCGAGCGG CCGCCAGTGT GATGGATATC TGCAGAATTC 
GGTATTTTAA ATTGTTACTA ATGCATAAGG GCAACACATT 
AGTAATTAAT GATGCTGTTA CCTCACAAAT ATGTATGTGT 
ACTATACATA TGTCTGTTTC TAATTGAAAA CACCAGGTAA 
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281 TTATCATCTG TAGAAACCCT AGTGTCTCAG ATAAGTTGGC TAGTTTTTTG TTTCACATAA AGGAACAAAC 
351 ATTTATAGAT TTATATGTAT £TAAAAATG GTAAAAATTG ^GGGTGCA 

; 2 9 ; sssss 'ssssssz « ««««« a*™*™ 

III Sagactgaa gSaggaaaa tcacttgacc caggaagcng aggttgcagg ngag 



SEQ - 1 I ^Sga N tcaattc N » N gs™ ™gg ccgocagt- OA— TGCAGAATTC 

71 GCCCTTCCTA TGTATTTCCT JCTAGCCAAC CTCCCACTCA TTGATCTGT s TTCAQATATT 

141 CCAAGATGAT TACTGACTTT TTCAGCCAGC GCAAAGTCAT GCTTTGACAG ATATATAGCA 

211 TCTCCTTCAC TTCTTTGGTG GGAGTGAGAT GGTGATCCTC JTAGCCATGG GCTGTCGCAT 
281 ATATGCAAAC CCCTACACTA CACTACAATT JTGTGTGGCA JCGCA TTACCCTTCT GTGGTCCCAA 

351 GGGGAATTGG CTTTCTCCAT TCGGTCAGCC JJTTOGCCTT ^GTG GTACAGATAC CTACAGGCTA 

S! ™gg !SS SSSSSS SSSSS SScttttgt CTTCTAATCA TCTCATACAC 

561 ATGACCATCC AGCATTGCCC TTTAGATAAG TCGTNCAAAG G 

™VS-2£«C C«CKK CTAGATGCAT GC™ GATGGATATC TGCAGAATTC 

71 GCCCTTCCCA TGTATTTGCT TCTCAGCAAC JTGTCCTTCT CTGACC GACTGCCTGA CCCAAATGTA 

141 CCAAGTTGTT ACAG AACAT G CAGAACCAGG ^CATCCM CCCCTA CCTATGACCG CTATGTGGCC 

211 CTTCTTCCTG TTATTTGGAG ACCTGGAGAA CTTCCTCCTT GTGGCCATGG GCGCTGTCCT 
281 ATCTGCTTCC CCCTGCACTA CACCGCCATC JTGAGCCCCA TGCTCTGTCT 

351 GGGTGCTGAC CACCTTCCAT GCCATGTTAC AtALl nArrArrcCT TCTCTGACAC TCGAGTTAAT 

: 2 9 ; ssss S55SS ™ T T sees ss; 

III SaSSgT 5SS TCAAGGTCCC TTTTNTAANG GTATCTGCAA GGCCCT 

SEQ. ID NO: 38 ^ MBMr „ N NrNrccCT CT AGATGCATGC TCGAGCGGCC GCCAGTGTGA TGGATATCTG 

1 NGNNNNTTNA NTCNANGCCN NGNGCCCTCT ^TW-au. cttatg GACC TGATGT ACATCTCCAC 

71 CAGAATTCGC CCTTCCAATG TATTTACTTC JCAGCCAGCT CTCCCTTATG £ ATGTGGTGTG 
141 CACCGTCCCC AAGATGGCGT ^CAACTTCCT GTCCGGCCAG ^GCATCT C TACGACCGTT 
211 CAAAGCTTCT TCTTCCTGAC CATGGCGTGT ^TGAAGGCT TACTCCTGAC ^ AGATGATTGG 
281 ATTTGGCCAT CTGCCACTCT "CTATTATC CTATCCGCAT ^TAAAATG TCC CTACTGC 
351 AGGCTCTTGG ACACTGGGGT CCATCAACTC CTTGGCACAC JCAGTCTT CAGATACTTG 
421 AGGTCTAGGG CTATTGACCA TTTCTTCTGC JATGTCCCAG CCATGTTOC GCATCACTTC 

\l\ SSSS SSSS TTGCTGGCTA SS TCAAAGGAGG GGAGG 

anktcakhgc cnntgskngc t „t gctcgagcgg CCGCCAGTGT GATGG g ATATC 

71 TGCAGAATTC GCCCTTCCAA TGTATTTTCT TCTCAGCAGG JJAGATATTT ^TCC CTGGCAGCCA 
141 TGACCGCTAT GTAGCCATCT GCTGTCCCCT GAACTACGAG ^CTGCACAGA ^ GCCTGC TCTC 
211 TGGCTTATGA CCGCTGTCTT GCCATCTGCT ATCCTTTACA CTACGGAGCC ATCATGAGTA ^ 
281 AGCGCAGCTG GCCCTGGGCT CCTGGGTCTG TGGTTTCGTG GCCATTGCAb ATTGCCCTGG 
351 GGCCTGTCCT TCTGTGGCCC CCGTGCCATC ^CACTTCT TCTGTGACAT * TGAGTTCATG 

\l\ £SS£ "at SSSS SSSS tcccctctgc AGTGGCCGGA 

561 SE5SS CTCCCGTGCT CCTCGCATCT NAACGNGGTG CTCATTTGGT ATGGG 

~ SS SSSS S5S5S SSSS SSSS = SESS 
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S EE SSii iEi » s 1= Hi 

si =E EE == S i=i = = 

561 ATTCTCAGGA TCCCTTCTGC AAGNGG 



SEQ. ID NO:41 

1 ATGGNNNNNN NNTTTNNNAA ANTTTTNCCC 
71 TTGGGCCCTT CCCGGAAGGC CCGGGGGCNC 
141 AATTCCTTTG GGCCAAGGNA AAAATTTTCC 
211 ANGGGGGGGT TGATTAATTA ATCGGGAAGN 
281 CCAACCCAAC AAGGTTGGTT TTCCANGGGA 
351 CATTCTTTTG GATTAAGGAA NGGTGCCGGG 
421 TGAGAACCAC GGTGAGATGC CGAGGAGCAC 
4 91 CCTGAGGATG GTGCTTGATG ATGTACACAT 
561 CAACAGCNAT CACAAAGGCC ACAAAGCTCT 
631 TGCAATGTCA CAAGAAAGAA AGTGGTTGAT 
701 GGCTTGTGGG CACTGCAATG GCCACGAAAC 
771 AGCAAGGCTA CTCATGAATG GCTTCCGTAG 
841 AAGCCATGGC TTGCCNG 



ANTTTGGGCC GNCCCCCCCT TCTTTAAGGN AATGGGCCCA 
CCGGCCCCAA AGGTTTGGGT TGGGAAATGG GGGGAATTTA 
NGCCCCCCTT TTTTTCCCCT TTTGGTTTTT ANCCGGGGGA 
TNGGGGGGAA NTTTTTTAAA AAAAACCTTG GGGGAAGGTT 
CCGTTGGGAC CAGGCTTTTN GAATCAAGAA TCCCAAAGGG 
ACCGGTGAAA GGGAAAAAAC TGGTGGACCC CATACCAAAA 
GTGGAGAAAG GCTTTGCTTC CGGCCACTGG CAGAGGGGAT 
AGGGAGACAA GGGTGATGAG GCATGAACTC AGGATAACCA 
ACTGCCTGTG TGTTGGGTGC AGGCCAGGGC AATCCAGGGG 
GGCACGGGNG GGCCACAGAA GGACAGGCCA CTTGATGAAG 
CACCAGACCC AGGAACCCAN GGCCAAGCTT GCGCCTGAAG 
TNGTAAAGGA TAGCAAGATG GCAAAGGCAA GCCGGTCATN 



SEQ. ID NO:42 

1 GNNNTTANNN CATTGCGCCC TCTAGATGCA 
71 TCGCCCTTGT TGCGCAAGGA GTAGATGAAC 
141 ACACTGGGCA CAAGTCCATG AGTAATCAAG 
211 ATAGAATCCC ATTACCCAGA GATAGACTGG 
281 CTCTCTATGC ATATGTATAC ACAAGCAATT 
351 TGCAACGAAT GCAATGTGGA GAATGAAGGC 
421 AGAGTGCAAC TGTCACTACA GCTAAAAGGA 
491 GTTTAAGACA CGCTGAGTCT CAAATGCCAT 
561 CAAAGCAGGT G GAT GAG AAA TCACATTTCA 



TGCTCGAGCG GCNCGCCAGT GTGATGGATA TCTGCAGAAT 
GGATTCAGGG CAAGGGAGTG CTGAGGAGAT AGACGGGTAT 
GCCTGTTATT TAAAAAAAAA AAAAAAAAAG CTTGAACAAT 
ATGGTGAATT AAACTTTCTG GTGAATTTCT TTCCAGATAT 
TTTGGAAGAA AAGATACTTT ATAAGGATAA GCCTGAAAAC 
AAGATGTGGC GAAGAAGGGC ACCACAATCT GGTGGCTGAG 
GAGCTGGAGA AGCTGGTGAG GACAGTAAGA GATGAATCTG 
GGCTCCCCTA GGTTGCCTCT TCAGATGTAA ATCTTAAGCT 
TAGTCCCTGC ACAGACGGCT NTNTTGAGCT 



SEQ. ID NO:43 

1 GNNNNTTAAN TCATTGCCCC GNNNGANGCA 
71 CGCCCTTCCC ATGTATTTGC TTCTCAGCAA 
141 CCCAAGTTGT TACAGAACAT GCAGAACCAG 
211 ACTTCTTCCT GTTATTTGGA GACCTGGAGA 
281 CATCTGCTTC CCCCTGCACT ACACCGCCAT 
351 TGGGTGCTGA CCACCTTCCA TGCCATGTTA 
421 ATGTGATCCC CCACTTTTTC TGTGATATGT 
491 TGAATGGGTG ATATTTATCA TGGGAGGGCT 
561 AAGAAATGCT CCTCATCCTC AAGGCCCTTC 



TGCTCGAGCG GCCGCCAGTG TGATGGATAT CTGCAGAATT 
CTTGTCCTTC TCTGACCTCT GCTTCTCTTC CGTGACCATT 
GACCCATCCA TCCCCTATGC GGACTGCCTG ACCCAAATGT 
GCTTCCTCCT TGTGGCCATG GCCTATGACC GCTATGTGGC 
CATGAGCCCC ATGCTCTGTC TCGCCCTGGT GGCGCTGTCC 
CACACTTTAC TCATGGCCAG GTTGTGTTTT TGTGCAGACA 
CTGCTCTGCT GAAGCTGGCC TTCTCTGACA CTCGAGTTAA 
CATTCTTGCA TCCATTCCTA CTCATCCTTG GGTCCTATGC 
TNTAAGGGTA TCTGCAAG 



SEQ. ID NO: 44 

1 GNNNNTNANT CNTGCCCTGN CCCNCNGCNC 
71 TTACTAAGAG TATAGATGAA CGGATTCAGG 
141 ACAAGTCCAT GAGTAATCAA GGCCTGTTAT 
211 TACCCAGAGA TAGACTGGAT GGTGAATTAA 
281 GTGTATACAC AAGCAATTTT TGGAAGAAAA 
351 AATGTGGAGA ATGAAGGCAA GATGTGGCGA 
4 21 TCACTACAGC TAAAAGGAGA GCTGGAGAAG 
4 91 CTGAGTCTCA GATGCCATGG CTTCCCTAGG 
561 GATGAGAAAT ACACATTTNA TAGTCACCTG 



NNGCGCCGCG GCGGATGGAT ATCTGCAGAA TTCGCCCTTG 
GCAAGGGAGT GCTGAGGAGA TAGACGGGTA TACACTGGGC 
TTAAAAAAAA AAAAAAAGCT TGAACAATAT AGAATCCCAT 
ACTTTCTGGT GAATTTCTTT CCAGATATCT CTCTATGCAT 
GATACTTTAT AAGGATAAGC CTGAAAACTG CAACGAATGC 
AGAAGGGCAC CACAATCTGG TGGCTGAGAG AGTGCAACTG 
CTGGTGAGGA CAGTAAGAGA TGAATCTGGN TTAAGACACG 
TTGCCTCTTN CAGATGTAAA TCTTAAGCTC AAAGCANGTG 
CACAGACGGT TTTTTGAT 
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SEQ. ID NO:45 

1 CATGCCCCGT CCCNCNAGNT NCNNGCNCCG 
71 ATTTACTTCT CCAACTTCTC CTTCCCATCT 
141 GGCGAATTCC AGCACACTGG CGGCCGTTAC 
211 GAGTATTCTA ACGCGTCACC TAAATAGCTT 
281 TATCCGCTCA CAATTCCACA CAACATACGA 
351 TGAGCTAACT CACATTAATT GCGTTGCGCT 
421 GCATTAATGA ATCGGCCAAC GCGCGGGGAG 
4 91 CTGACTCGCT GCGCTCGGTC GTTCGGCTGN 
561 ATCCACAAGA ATCAGGGGGA TAACGCAAGA 



CGGCCGCNAN GGATATCTGN ANAATTCGCC CTTCCTATGT 
CTATCATTAG AACCCATTCA TATACACCCT ACGAAACAAG 
TAGTGGATCC GAGCTCGGTA CCAAGCTTGA TGCATAGCTT 
GGCGTAATCA TGGTCATAGC TGTTTCCTGT GTGAAATTGT 
GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG 
CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT 
AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCTCGCTCA 
GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG NAATACGGTT 
AAAGACATGT GA 



SEQ. ID NO:46 

1 GNNNTNATTN ATTGCATTGG GCCCTCTAGA 
71 AATTCGCCCT TAGTGAGTAG ATGAAAGGGT 
141 AGTGTCCTTT AGAGTAGAAC TATTAGCTGA 
211 GACACCACAG ATAGGTGGGA GCCACAAGTA 
281 GGATGGAGGA GACAATTCTT GCATAGGACC 
351 CATGATAAAC ATCACCCATT CATTAACTCG 
4 21 CAGAAAAGGT GGGGGATCAC ATTGTCTGCA 
4 91 CATGGAAGGT GGTCAGCACC CAGGACAGCG 
561 GTAGTGCAGG GGGAAGCAGA TGGCCACATA 



TGCATGCTCG AGCGGCCGCC AGTGTGATGG ATATCTGCAG 
TCAGCATGGG GGTCACCACA GTGTACATCA TAGCCATGAC 
TGAGCATAAG TAGAGACCAA TAACGGTTCC ATAGAACAGT 
GAGAAGGCCT TGCAGACACC CTTAGAAGAA GGGACCTTGA 
CAAGGATGAG TAGGAATGGG ATGACAAGAA TGAGCCCTCC 
AGTGTCAGAG AAGGCCAGCT TCAGCAGAGC AG AC AT AT C A 
CAAAAACACA ACCTGGCCAT GAGTAAAGTG TGTAACATGG 
CCACCAGGGC GAGACAGAGC ATGGGGCTCA TGAGGGCGGT 
GCGGTCATAG GCCATGGCCA CAAGGAGGAA 



SEQ Vpna?gggccc TCTAGATGCA TGCTCGAGCG GCCGCCAGTG TGATGGATAT CTGCAGAATT cgcccttcca 
561 GTCTTCTTCA TNCTCAANGG CCCTTTCTTC TAANGGTATC TTGCAAG 

s " ™s = = sssss = esse = 
,s ss ss. I= c J,™™™ ?s= ttc SS 

is EE iii ilii sss = ~ ~ 

il\ rrAGCTTACT CACAATAATT GGGGTGNGCC CACTGGCCCC TTTTCAGGCG GGAAAACCTN GCGGGGCCAG 

« iii Ss ssss = ss s— ™- 

s 7™ sss ssss = EE EE i= 
si ™ EE EE EE EE isss =ss 

'I SSSSS SSaTC CCGCTCACAA TTCACACAAC ATACGAGCCC GGGAGCATAA AGTGTAAAGC 
,,7 JLprrr rTGC C TAATGAGTGG AGCTTACTTA CATTAAATTG CGTTGCGCTC ACTGGCCGCT TTTCCAAult. 
491 GGGAAACCTG SSSSS TTCANTAATG AATCGGCCAA CGCCGCGGGG AGAGGCGGGT TGCGTATTGG 
561 GCGCTCTTCC GCTTCTTNGT TNACTGACTT CGG 
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SEQ. ID NO: 50 

1 GNNNTTTAAC NCCGGNGCTN CNAGCAGTGG 
71 TCTCAAGTCT TCCTCTTCTC CAAAGATTTT 
141 ATTTCAATAA CTGTATTCTC TTCATCATTT 
211 TTCCAATTTT ATCCTTCTTG TTTGCTGATT 
281 CCTGCCACCC TTTTCATATC CTCAATTCCC 
351 ATTCATAACA ATAATTATGT TTTTTTTTGC 
421 TCATACTTTT CTGGAAACAC TAATTCTGGT 
4 91 GCTGAAGATG ATGGCTAGAC AAATACTCAC 
561 TTACAAACCC CTTCATTTAT TCTCCGCAAC 



AACAACGCAG AGTACGCCCC CGATGTACTT TCTTTTTCAG 
GTCTTTTCTA CTACCTGAGC TACCAAATCC CTTGTCATCA 
CAACTTCAAA CGTGTCATCT CAGAACAAGC TTCATGTTAC 
CCAAGAATTC CAGTCCCATC TAGGCCCGCA ATGCATTGTT 
TTGTATCATC ACTTTCCTTT TATATAGCAC AGATTCCATG 
ATGTGCTCTT AATTTCCTTT CTTGCTCCTA TTATCTTCTA 
GAAATATACT CTTTGTGGAC TTTGCACTTA TGCTCAGTCA 
AATCATGCTG ACTGGCCCAA TTTATAGTCA TGACCACCGA 
AGGGGCGTCT TCTGCGCTTG AGCGTCCGGT GGGG 



SEQ. ID NO: 51 

1 GCAGTGGTAA CAACGCAGAG TACGCCCGTT 
71 TCACAGTACT GGAGCACACA GCATGTGAAT 
141 CAGTGCAGCA GGGCGAGGCA CTTGTCTTAT 
211 TTAATCCCAT TTTCACAGAT GGGAAAGGGA 
281 GTAAGAAGCT GCCCTCAAAG TCCCTACCCT 
351 CCATACCTCT GCCTTATATT TTAATGAAAA 
421 GAAACATTTA GTGAAAATCA GAGCTAGAAG 
4 91 AAGACATCAA AGTTGTCTTG TGCAGCAAAT 
561 AGGCTTCTAA GAAGAGGCAG AAGN 



ACGGAGGCTG TAAATAAAGG GGTTGAGGAA GTAAAGTACT 
TTCAGCCAAA GGACAAATGC CTCCAAAAAA AGTTAATTCA 
TCGCTGGTTC TCACATTGAC CCTGAAAGGA CTTTTTTTTG 
CTCTGTATGG TTGTCACTTT TATCCAAAGT CTCATAGCCA 
GTCTTCCATT CGACTATTCT GAGGTTCAGA CCCAGAAACC 
GTATGTCTCC AGGTTTATGT GGAGAATAAC CAAGACCTCA 
GAATCTGTTT TTTTGCGAGT TCAGAGAAAC TGACTTGGAT 
TCTCCTCCGG CACATAGTAG GCACTCTGAT AAATTCAAAA 



SEQ. ID NO:52 

1 GTGAANCCAN NNTAANNCCN ATTGGAGCTC 
71 TTCTTCTTTC CTTCCTTCCC TCCTTCCTTC 
141 CCCTCTCTCT CTCCTTTTTC TTCTCCTTCC 
211 TCTTCTTCTT CTTTCTTTCT TTCTTCTTTT 
281 TAAGTCTTTT CTTTCCCCAT TACTGTTAGA 
351 GCATTTACAC GTTTCGTAAC GGGCGTACTT 
421 CCTTTAGTGA GGGTTAATTG CGCGCTTGGC 
4 91 CCGCTCACAA TTCCACACAA CATACGAGCC 
561 AGCTAACTCA CATTAATTGC GTTGCGCTTA 



CAAGCAGTGG TAACAACGCA GAGTACGCCC CCGATGTAGT 
CTTCTTTCCT TTCTCTCTCT CTCCCTCTCC CTCTCCCTCT 
TCCTCCTCCC CCCAATCCGT TCATGACTTC TTCTTCTTCC 
TCTCTAAGCA GGATCCTGGG CTGTTCAAAC CAGAGAGCTG 
TCCGTTGAAT CGGCTCCAGA AACCAAACAA GTTAACCCTT 
CTGTCGTCTT GAGCGTACTG ATGGTACCCA GCTTTTGTTC 
GTAATCATGG TCATAGCTGT TTCCTGTGGG AAATTGTTAT 
GGGAGCATAA AAGTGTAAAG CCTGGGGTGC CTNATGAGTG 
CTGNCCGTTT TCAGTCNGGA AAN 



SEQ. ID NO:53 

1 TNANNCNNNT TAANNCCCAT TGGAGCTCCA 
71 GCTTCTTCTT CTTTGGAGTG GCTGAATGCT 
141 CTGCAGTCCC TTGCACTACC CAGTCATCAT 
211 TTCCCAGGCT TTCCTGTAGC TACTGTGCAG 
281 AGGTGAACCA CTTCTTCTGT GACAGCCCAC 
351 GATCTACGCC ATCGTCGGAA CCATTCTGGT 
421 CGCATTGCTG CTGCCATCCT CAAGATCCCA 
4 91 CACACCTCCT TGGTGGCTCT CTTTTCTATA 
561 TTCACCTGAG GGCACGAAGC TGCTATCATT 

SEQ. ID NO:54 

1 GTTNTTCCAT GGACTCCCAA GCAGTGGTAA 
71 CTTATCATTT AT AG AT AT C A TTTATTCTTC 
141 AATAATTCCA TATCCTTCCC ATCTTGCTTG 
211 TCTTTCTTCT GTTGGTGATG GCCTATGACC 
281 TGCTGCTGGT AGTGTCCTGG GTTGGAGGAT 
351 CCCATTCTGT GACCTCAATG TCATTGATCA 
421 ACCGATACCC ATGTTATTGG CCTCTTAGTG 
4 91 CTTACTCATC TCTTATGGNN CATCTGCACT 
561 CTCAACCTGC AGTTCCACAT AACTGGGGGG 



AAGCAGTGGT AACAACGCAG AGTACGCCCC CGATGTACTT 
TCCTCCTGGC TACCATGGCA TATGACCGCT ATGTGGCCAT 
GAACCAAAGG ACTCGTGCCA AACTGGCTGC TGCCTCCTGG 
ACCACATGGC TCTTCAGTTT TCCATTCTGT GGCACCAACA 
CTGTGCTGAG GCTGGTCTGT GCAGACACAG CACTGTTTGA 
GGTCATGATC CCCTGCTTGC TGATCTTGTG TTCCTATACT 
TCAGCTAAAG GGAAGAATAA AGCCTTTTCT ACATGTTCCT 
TATCATTAAG CCTCACCTAC TTCCGGCCTA AATCAAATAA 
GCCTACACTG NTATGACTCC A 



CAACGCAGAG TACGCCCCCT ATGTACTTAC TTCTTGCTGG 
ATCCATTTCC CACAGATCGA TTTCAGACTT GTTCTTTGGG 
GCCCAGCTCT TTACAGAGCG CCTTTTTGGT GGGTCAGAGG 
TTGCATTACT TGGTTATCAT GAGACAATGG GTGTGTGTTT 
TTCTGCACTC AGTATTTCAA CTTAGTGTTA TTTATGGGCT 
TTTTTTCTGT GATATGCACC CTTTATTGAA ACTGGTCTGT 
GTGGCAATGG AGGACTAGGT TGCACTATTG GGNTTCTGCT 
CTCTAAAGAA CCTTAGTCAG AAAGGGAGGT GAAAAGCCCT 
TGGTTTCTTC TTTGTN 
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SEQ. ID NO: 55 mmrrrrT AACAACGCAG AGTACGCCCC CAATGTACTT 

1 TTANNCCNNT TNAATNCCNT TGGAGCTCCA ^AGTGGT ^AACGCAG ACGTGGCCAT 
71 GCTTCTTCTT TTTTGGGGCT GCTGAGTGCT GCCTCCTGGC CACCATGG CTGCCTCTTG 

141 CTGTGACCCC TTGCACTACC CAGTCATCAT GGGCCACATA TCCTGTb TCCCTTTTTG TGGCCCCAAC 
211 GTTCTCAGGG TTTTCAGTGG CCACTGTGCA £CCACATGG ATTTTCAGT 

281 AGGGTGAACC ACTTNTTNTG TGACAGCCCT CCTGTTATTG ^ TGCTGATCCT GGGATCCTAT 
351 GAACTGGAGG CTCTTGACAG CCACTGCCTA ATTCATTCTC TTTCCTT^ TNAGCATTCT NCACCTGTTC 

421 TTCGCATTCT CTTCACTATC TTTAAGGATG CCGTCAGCTG GTATTTTCCG ACCCCAATTC 
491 CGCCCACCTC TTGGGTGGCT CTCTCTTCTA TAGCACTGGC AMCCTT ^ TTCCAATGTT 

561 AAGTGCCTTT TTNTGAGAAG CAAAGAAACT GGTTGTCACT 

SEQ. ID NO: 56 rr***rr*n GGTAACAACG CAGAGTACGC CCCCCATGTA CTTTCTTCTT 

1 GNGNTTTNNN CCATGGAGCT CCAAAGCAGT GGTAACAACb ^ ATGTGGCCAT CTGCAGTCCC 

71 CTTTGGAGTG GCTGAATGCT TCCTCCTGGC TACCATGGCA ^GACCGC TTCCCAGGCT 
141 TTGCACTACC CAGTCATCAT GAACCAAAGG JCTCGTOCCA AAC G GGCACCAACA AGGTGAACCA 

211 TTCCTGTAGC TACTGTGCAG ACCACATGGC JCTTCAGTTT G CACTCTTTGR GATCTACGCC 

281 CTTCTTCTGT GACAGCCCAC CTGTGCTGAG GCTG£l<~i TTCCTATACT CACATTGCTG 

351 ATCGTCGGAA CCATTCTGGT GGTCATGATC CCCTGCTTGC TGATCTT ACATGTTCCT CACACCTCCT 
421 CTGCCATCCT CAAGGTCCCA TCAGCTAAAG ^AAGAATAA ^CCT TTCACCTGAG 
JS G— C SS! SSSS S— STGTTGAAC CCCATAATTT ATTCATTCAG 



631 C 



SEQ. ID NO: 57 ^mrTCGT AACAACCGCA GAGTACGCCC CCCATGTATT TTCTTTTTCT 

1 TTATNNCCAT TGGAGCTCCA ^CAGTGGT ^AACCGCA GNGGNCATCT GCAGTCCCTT 

71 TGGGGNAGCT GNATGCTTCC ™CTGGCTAC CATGGNATAT GACCGGC TTCCTGGGTC CCAAGCTTTC 
141 GNNCTCCCAG TCATTATGAA CCAAAGGACA CGGGCCAAAC J GC ACCAACAA GGTGAACCAC 

211 CTGNAGCTAC TGNGCAAGAC CACAATGGCT CTTNAGNTll AGCACTGTTT GAGATCTACG 

281 TTNTTTCTGN GACAGCCGGC TGTGCTGAAA g^TCTGN JGCAAG GNGTTCCTAT ACT CGNATTG 

351 CCATCGTCGG AACCATTCTG GTGGTCAATG ^CCCTGCT CCTTTCTCTA CGTGCTCCTT 

421 GTGCTGCTAT CCCTCAAGAA CCCATCAAGC TAAANGGGAA GCAATAAAGN TGGGCCTNAA 
III SSSSS SS SSSSt SSSISS SSU CACTGGTNGN GACTCCATGN 



631 TGGAACT 



SEQ. ID NO: 58 nrmrTrCTA ACAACGCAGA GTACGCCCGT TCCTCAGACA 

1 GTNATNCCNT TTAATNCCNT TGGAGCTCCA £GCAGTGGTA JJAACGCAGJ ^ TGCAGTAGAG 

71 GTATATGAAT GGGTTAAAAA TGGGCCAGAG ^AGATGCAGG J^J AGAAAGTCTT TAGGAGGTAG 

141 TCAAATCTAG GGCTGATGGT "CTTGGGAT GCATAGTAAT JGGTA^ CCTGGGTTTC TAGCTTGAGC 
211 AATGGACAGG ACTTCACAAT GCATTAAATG TAGGGAGAAA AGTTTGGAAG AGAAAGATGG 

281 TAGTAGGGAT AGTGGTAGAA TTTACTGATA TGGAAAACTG GAGGAAAAAG ^TT CAAAGGAGAG 
351 CAAGTTAAAT ACCTGTGGGA AATATAATCA CAGACACTAA ATAGGCAGCT G ^ TQTATGAGAC 

421 CCATGGGCTA GGAACATACA GTGGGATTCC CTGGCATGTC GCTGGCCAGA GGATGAGAAA 

tl\ SS G T o~ SSSS5 ESSE Sggagaaa NGGTTGAAAT T 

SEQ - 1 I ^;TTA N NC NCTGGAGCTC CAAAGCAGNG GTAACAACGC AGAGTACGCC CATTGCGTAG CGTG g TACATA 
71 AAGGGGTTGG AGCTGAAGGA GGAGATAAAG ^AAGACAG CCAGAACCTT 6 ^ AGG tGGGTGC 

141 GGGATCTTGG GCCGTAGATA GGTATAAGCA AAGGGTGCAT ft GlAb ACAGCAAGGA GAATCCGGCC 

211 TGCAGGTCGA ATAGGCCTTC JTCCTCCCTT CTGCAGAGTG JATGTGGTAG G qt GGAG tCA 
281 ATAGGAACAT GCAATACAAA TGAAGGGAAA CACAAOAAAA GAA AAAATGA TTGATGGCTC 

]\\ SSSSS — S ~I C T SSSSS «««« ™C»TCC> 
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ss ™ ss ss =: sss sss ssss 

631 T 

SEQ. ID NO: 60 a ,TrrMVTC CCAAGCAGTG GTAACAACGC AGAGTACGCC CTCCTTGTTT 

I TGTTANTCCN NTTTNCTNCC ATTG^GCTC CCAAGCAGTG TAGGACAAGG ACACAGACAC 
71 CTGAGAGTGT AGATGAAGGG GJTATAGGAG ^AAAGATCA ™ TTCAGCAGGG GTCTCACATC 

141 TGACAACAAA GTTGATTATC JCATTGACAG TGGTGTCTOT GCAG ^ tagatgttac ttggacaATA 
211 ACAGAAGAAG TGGGAGATGA CAAAGTCATC JCAAAAGGGC AGACACAAGC CCTCTTACCT ATGAATACCG 
281 GCCATGCCCA GGCCAATCCT CAGTGACCCA 6JJCCCMTC £ GAAAG CAGTT GTTGATGCCA 

351 TAAGGGGTTG CAGAAGACCA CATAGCAATC JJATCCCATG GTGAGGATTC AAGAGGCGAG 

421 AAAGTCACAT AGAAGAGCTG JOTGACACAG JCTTGCATGA C ^ ^GGAAAAAGT 

III SK TACTTCTGTC SSESS TACTGATGGT ACCCAGCTTT TGTTCCCTTT 

"VSSSo-. TGTAGCTCCN AA = TTCCTCCTGT SSSSS SSSSS SS 

71 TTGTTCCTAC TCTTTGCTGG ATTTGAAAAC "CCTCCTGT £ CTGCTGGTTC TAGCATCCTG 

141 TCTGTCACCC CCTGCACTAC ATGGTCATTA JGAACCCTCA TGTCCTTCTG CACAGCCTTA 

211 GACCATGAGT GCTCTGTATT CCTTGCTACA ^CTTAATG ™ TTCTGATAGC TTTCTTAATC 

281 GAAATCCCCC ACTTTTTCTG TGAACTTAAT CAGGTCATCC ™ GGGATCCTTT ACTCTTACTC 

351 ACATGGTGAT ATATTTTACA GTTGCGCTGC ™GGTGGMG ^GCT AGGCATTTTC ACCTGTGCAT 
421 TAAGATAATT TCTTCCATAC ATGCAATCTC ATCAGCTCAG G^^ cttagtctgc 

III SaSct CAAGTGCAAC AGCCTCAGTG ATGTACACTG GGGCACCCCC AT 

™VSISS«. TTNATGCCNT njrjgjjg «— [ »C™ — GC JgJXJC^ 

,2 K££ S5SSS T= SSSjS -~ TTTAACGAGT SHE 
2U CAAAGCGACT CAATCAAATA CTCCKGGAA «=«<™£ ^AGCTCT CTACCATTTG GTACTGGTAC 
281 GATTATTACC TGTTGGAGAA ^TAGGCAAC ^TGAATTAA g TGCCAGAACT CGGGCTTTCA 

S SS SSSS SSSSS ~ AAAC^GC, C—GA G2G.GAGGAA 

is ss ss ss sssss sk- tt ™ 

SEQ. ID NO: 63 .,„„„.,- APTACGCCCT CTTGGTTACG TAAGGGAATA GATGATGGGG 

1 TGTAGCTCCA AAGCAGTGGT AACAACGCAG AGTACGCCCT ^ CTCTGCTCGC GTCGGGACGT 

71 TTCAGCATGG GGGTGACTAC AGTGTACATG JCAGTGGCCA CACGGTL NCCAGGTGGG AGCCACAGGT 

II GGCCTGGAAG TAGACTGCAA TGACTGTCCT ^AGAAAGAG GCTCACC GGNAAGCTTG NTATGGNCCC 
211 GGGNCACAAG TCCCGGAGCC TCCCAGAGGC TTGAGGGCAG CTGGAGCA ctngqtgaag ATGAGCAGCT 
281 ACAAGGAGGC GAGGATGAGC AGNAAGGGAG JGACCACCAC ™ GtGAG aGAAG AAGTGGGNGC 
351 TGGATGTGGT GGNTGTCAGA GCACGAGAGC CTTTAAGAGA AGGAGGGAGT TGTCCGTGGG 
421 ACTTTGTGGG AAAGCACAGA ^GACAAGC GAGCCATGAG £ CTCGTGGGAT AAGGGAAGGG 
SI GTGCCGGATN SSS ~CC AGAAGACAGC TTTNAATTTA CCCCAGG 

SEQ -1^tScCNT NTANCTNCAA NNGAGGTAAC gDOCgg TGTAT S JCTWJOCg 

71 CCTGTCCTTT GTAGAGATCT GCTACACCAC CGTTGTGGTG CACACTTGGT GGTGCTGACT 

141 CAGAAGCCCA TTCCATTGGC TGGATGTGGG GCCCAAATGT JCCTCT CCTTTGCACT ACCCCTCATC 

211 GTTTCCTCTT GGCGATCGTG GCCTATGACC GCTATGTGGC ^ CCTCTTCCTC TCCCTGCAGC 

Si SSSS "AS TTGCCCTTCT SSSSS CCAGGAAATT AAGCACTTCC TCTGCGATGT 
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S! SSSS = SEES = SSSE = 3SS5S 

Si gwSgctS gggccggcac caggcctttt caactgctct tccgg 

SEQ. ID NO: 65 .^nrrrrrr rrrAATCTAT AGATGAAAGG GTTTGGNGAG 

1 TGTAGCTCCN AAGNNGAGNT ANCAACGCAG AGTACGCCCG CGGAATCTAT ^AT CCTCAGGTGA 
71 TCAGAAGAAG GAAGTACATG JGAGTCATAA CAGTGTAGGA CAATGATGGC AGCTTCT^ 
141 ATTATTTGAT TTAGGCCGGA AGTAGGTGAG ^CTTAATGAT JTA GAGGATGGCA GCAGCAATGT 

211 GAGGAACATG TAGAAAAGGC "TATTCTTC ""iJJ CACCAGAATG GTTCCGACGA TGGCGTAGAT 

- SESS = SEES Sggj; s — ™ s 

; 2 9 ; ess ss = ssss ssss ™ GC 

561 aga?ggccac atagccggtc atatgccatt ggtagcccag gangaagct 

SEQ. ID NO: 66 „„.„„„„ Rrr , rn rrTA ACAACGCAGA GTACGCCCCT ATTTCTCAGA TATANGATGA 

1 GTTATNCCTT GTTGCTCCCN AGCAGAGGTA JCAACGCAGA aaaAGAAATA GTCTTGTGCT 

71 AGGGGTTCAG AAAAAGAATG AGCAAAGAAA JTCTGGGCCA GGCGGGCATC ATGACAGGTT 
141 CAACCAGAAA GTCTGCAATC ATTTTAGGGG ™GCAGAAGA GGCAACA ccc AGCTTTTGTT 

211 GGCAAGAAGC AAATACATTG GGGGCGTACT TCTGTCGTCT JGAGCGTAC GAAATTGTTA 
281 CCCTTTAGTG AGGGTTAATT GCGCGCTTGG CGTAATCATG GTCATAGCTG ™ CTAATGAGTG 
351 TCCGCTCACA ATTCCACACA ACATACGAGC CGGGAGCATA AAGTGTAAAG <£T GCCAGCTGCA 
\l\ S55SS CC™ T C T G G CG S CO™ EE S TCTTCCGCTT CTCGCTCACT 

s« SSSSS! cgctcggtcg ttcggcttgc ggcgagcggt atcaagctca ctcaaat 

SEQ -i I ^5tac N ctgtgcnccc ccagcagngg taacaacgca gagtacgccc , tg= gaaataaatg 
7i aatgggttta aaatagacgt gaagatggtg tagaatacag caaggacttt gtcaa cagtcaatgt 
141 agggccacac atagatgaaa atacacgatc caaagaataa agtgaccaca gtg gtagganaca 

211 GGAGTGGGCC TTCACCATGC TTACAGAGGA GCGATTCCTA ACTGTAA. TAGTAGGCTG ACAACATAAG 
281 ACCAANAGGA GAAAGGAACT CAGAGAAAGA AAGCCACCAT CTGCAACTAT ™ TTAGGACCAC 
351 TGTCTATGCA GGCTAACTTN GTNGCTAGAG GAAGGTCACA GAAAAAAAC AGGAAATGAG 
421 ANAATGGCAG ATTAACCGTG AATGCCAACT GGCTGGTGGT ATGGATGAAG GCTCATANC C 
iS S££S SS SSSJ TTACTTGCAG TGNNGGAGNG AACATGAAAT N 

— •x'SlSSEc-l TTTAATNCNA TGGAGCXCCA A.GCAGTGGT AACAACGCAG AGTACGCCCC —CTT 
71 GTTCCTACTC TTTGCTGGAT TTGAAAACTT CCTCCTGTCC GTGATGGCCT A GCATCCTGGA 
141 TGTCACCCCC TGCACTACAT GGTCATTATG AACCCTCACC TCTGTbW^ AGCCTTAGAA 
211 CCATGAGTGC TCTGTATTCC TTGCTACAAA JCTTAATGGT AGTACGGCTG TCTTCT 

281 ATCCCCCACT TTTTCTGNGA ACTTAATCAG GTCATCCAAC TTGCTTGTTC ™A CTTACTCTAA 
351 TGGTGATATA TTTTACAGTT GCGCTGCTGG GTGGAGGTCC CCTCACTG^ ^ C?GTGCATCT 

JS ? gc cc?ta mSo Sctaggg gtgaccttag ttttgctgnc acccgcaact 

561 ScaSSg Saacaacc tcagtgatgt acactggggt caccccatgc c 

SEQ. ID NO: 69 „*nmnnr*r TArAfTTACN ANGANTNCGC CNAGCGCGCA NTTNNNCCAG 

1 GNGNNNCGAG NTTANNCCTT GGACTCCCAG JAGAGCTACN ANGANTN^ TAGCTGGAGC 
71 GGTNNTNTTN GTATCACCAA TG^TAGAAA ACAGACACCA CCTTGTCCCT J ^ AGGGAGAGG x 

141 TGGGTCGCAA GTACACGAAA AGGGCTGTCC GAAACAGCAG ^ ACACTATGTA ACCATAGGAG 

211 GTTGCAGGCT TTCCATCGGC CCTCTGCTGA AGGGATCTTC AGGACCGCAG .j. GAG TGAGGA 

SI SSSSKS SS SgSg Saggtt gcagaagaag tggttgatga tgttgggtcc 
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191 rr AAAAfTGA AGCCTAAATA TGGAGCTGGC CTGGATCAGG GAGCTCAGGA AGCCACCAAC ATATGCCCCA 

9 SSSSS ™5££E gcStgggtc ATGATAGTGG ggtanagaag ggggctggag atggcttgca 
561 iSSSS Sgccatagc agtcangagg aggcactcaa gacagaccca tgccgacnaa gaaat 



SEQ. ID NO:70 

1 GNNNNTTTTA CCCCTGNNGC ACANAGCAGT 
71 ATTCTGGACA CGCtACTCCT GACCGTGATG 
141 ATATGATCAT CATGAACCCC CACCTCTGTG 
211 ATCCCTCCTC CATATTTCTC TGATGATGCA 
281 TGCGAACTGA CGTACATCCT CCAGCTGGCC 
351 TGACGGGTGT GCTGGGCGTT TTTCCCCTCC 
4 21 AAGGAAGATG TCCTCATCTG GGGGAAAACA 
4 91 TCTTTATTTT ATGGGACAGG CATTGGGGTC 
561 CCGTGGCCTC GGTGATGTCA CTGNGGTCAC 



GGTNACAACG CNCGAGTACG CCCCCTATGT ATTTTTTCCT 
GCCTATGACC GGTTTGTGGC TGTCTGCCAC CCTCTGCACT 
GCCTCCTGGT TTTTGTCACC TGGCTCATTG GTGTCATGAC 
TCTAATCTTC TGTAAAGATT TTGAAATTCC ACATTTTTTC 
TGCTCTGATA CCTTCCTGAA CAGCACGTTG ATATACTTTA 
TTGGGATCAT TTTCTCTTAT TCACGAATTG CTTCATCCAT 
AATAGCACTT TCCACCTGTG GGTCTCACCT CTCCGTCGTT 
CACTTCACTT CTGCGGTGAC TCACCCTTCC CAGAAAATCT 
CCCCATGTTG ACCCTTTCAT TTACACCCTT AGCAAG 



SEQ. ID N0:71 

1 GNNNNNNNNN GTTNATNCCN NTTTTAATGC 
71 CCAGGGTTCA ACCNNTGAAT AGAAAACAGA 
141 CGCAAGTACA CGAAAAGGGC TGTCCCAAAC 
211 AGGCTTTCCA TCGCCCTCTG CTGAAGGGAT 
281 AGTTGGAGGA ACGATGTTCC TCCGACGGTG 
351 CAG AGCAAG A CAGAGCCAGG ACTGGTGGGG 
4 21 AAAAGTGAAA GCCTAAATAT NGAGCTGGCC 
4 91 CCACCATGCG TGTACAGAGG CCCTGGGTCA 
561 TATCGGTCGT TGCCATAGCA AGTCAGGAGG 
631 GNC 



CANTNGAGNT AACAACGCAN GAGTACNCCN NNGNGTACGC 
CACCACCTTG TCCCTGCCTA GCAAGTAGCT GGAGCTGGGT 
AGCAGAGTCA CCACCATCAG ATGCGAGGCA CACGTGTTGC 
CTTCAGGACC GCAGACACTA TGTAACCATA GGAGATAAGG 
ACCACCACGA GGAAATTCAC CACTTGACTG AGGAAGGTGT 
AGGTTGCAAG AAGAAGTGGT TGATGATTGT TGGGTCCCGC 
TGGATCAGGG GAGCTCAGGA AGCCACAACA TATGCCCCAA 
TGATAGTGGG GGTNGAGAAG GGGGCCTGGA GATGGCTGCA 
AGGCACTTCA GACAGACCCA TGCCNCNAAG AAAAAAAACT 



SEQ. ID NO:72 

1 GNNNNNNNNN NTTNNNNCNN TNACTCCNGC 
71 ACTAANGGTA ANNTNAGCTG GAACACATCA 
141 GTCACCCCCT GCACTACATG GGTCATTATG 
211 CCATGAGTGC TCTGTATTCC TTGCTACAAA 
281 AATCCCCCAC TTTTTCTGTG AACTTAATCA 
351 TGGTGATATA TTTTACAGGT TGCGCTGCTG 
421 AAGATAATTT CTTNCATACA TGCAATCTCA 
4 91 CTACCCCTCA GTTGCTNCTT ATTTTATGGN 
561 CTACACTCAA TGCACAAGCT CAGTGATGTA 



AGTGGTAACA ANNANTACGC NCAGCGCGCA GTTAACCCTC 
NTACGNTCAN GNNNGCNCNA TGACCGGTTT GTGGNCATNT 
AACCCTCACC TCTGTGGACT GCTGGTTCTA GCATCCTGGA 
TCTTAATGGT AGTACGGCTG TCCTTCTGCA CAGCCTTAGA 
GGCATCCAAC TTGCTTGTTC TGATAGCTTT CTTAATCACA 
GGTGGAGGTC CCCTGACTGG GATCCTTTAC TCTTACTCTA 
TCAGCTCAAG GGGAAGTCAA GGCATTTTTC ACCTGTGCAT 
GCAATCCTAG GGGTGACCTT AGTTCTGGTG GCACCCGCAA 
CACTGTGGCA CCCATGCTGA ACCN 



SEQ. ID NO:73 

1 GTNNNNCNNN TTGATTNCCA TTGGAGCTCC 
71 TTTTCCTATT CTGGACACGC TACTCCTGAC 
141 CTGNANTATA TGATCATCAT GAACCCCCAC 
211 TNATGACATN CCTCCTCCAT ATTTCTCTGA 
281 TTTTTTTNTG CGAACTGACG TACATNCTCC 
351 ATACTTTATG ACGGGTGTGC TGGGCGTTTT 
421 TTNATCCATA AGGAAGAATG TCCTCATNTG 
4 91 CCTCTTCCGN CGTTTCTTTA TTTTATGGGG 
561 ACCCCTTCCA GAAAANTTTC CGTGGGCNTC 



AAAGCAGTGG TAACAACGCA GAGTACGCCC CCTATGTATT 
CGGGATGGCC TATGACCGGG TTGNGGCTGG CTGCCACCCT 
CTNTGTGGCC TCCNGGTTTT TGNCACCTGG CTCATTGGTG 
TGATGCATCT AATCTTCTGT AAAGANTTTG AAANTNCACA 
AGCTGGCCTG CTCTGATACC TTCCTGAACA GCACGTTGAT 
TCCCTCCTTG GGATCATTTT CTTCTTATTC ACGAATTGNT 
GGGGAAAACA AATAAGCACT TTTNCACCTG TGGGNCTCAA 
ACAGGCATTT GGGGTCCCAC TTTACTTTTT GNGGNGACTC 
NGGGATGTAC ACTGGNGGCA CCCCCATGTT GAACCCTTTT 
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FIGURE 2 

S^'nJStncntt naccctc.ga tjctjctcj. jcjjccjjc jjjtjjjj;. « 
Jatctgcaga attcgccctt ctgtt.cgc. ^atatata £JW£ a ? a gggttga 180 
ataaatgggt tactgaggaa ""Jaaatg ^"actg g J nacttagggg 240 

aagaactgta aaatagaaaa aggaccttnt gctgctcctc agg yy yy cctca g C ccg 300 
ccatgtacat gacgatgnng ctgccnntna ag.gtcccac tnctcancng c g 9 ^ 
nctttttnct cacnnnccnt ntttntctnc cctcttnnnc tctttncttc nntcn 42Q 

cccttccnct cctccctttt gcntnaccat tgnccctnat ccctttaat ^ 48Q 

tctcccctct attccttcnn tnttcgnctt "^ctctnc ctctttc cett 540 

ctcntctnct cttcctctng tcatcctngt tcnttctctt ^ctantt ctc 600 

ntcttattnc tcctctatnc cctctcatct "cntctcnt jctctcntc 66Q 
nnctcttccn ctccgtcntc cnctttctct tcntnacgcc ^ccctc tatn 120 

ntctcntcct cactctctcc tctccctncn »t«ctntt ctccncct 180 
ctcncnttct nncttnactt tgtcacgctc "ctcctcct ctct.cg atn 840 

ttatctcncn catcnccctc nnttctncac nctattnact cttttctcn ^ 
ctSctntcnn cttanatcnc ctcccttctn tnanccnntc actgcn 

SEQ ID NO: 112 „„-„„ na tn taccnnnntn gcnnttagat 60 

gctgctcgag cg.cgcagcg tcggcagtgt -gggnnatn tgccnn * 120 
nanaggnntn agtatggggg ^ ac «cagt W^acataac 9 taaa ataagg 180 

gtgagttgcg ggtggcagca gaactaaggt acacccctag g« 9 tgagctgatg 240 

agacLctga gaggtgagat ;c.c.ggtjg - *g£ccc. gJcaggggac 300 

agattgcatg tatggaagaa attatcttag agtaag g ^ ctatcagaac 360 

ctccacccag cagcgcaact gtaaaatata tcaccatgtg J aa ggctgtgc 420 

aagcaagttg gatgacctga tt.agttcac -J""^, ctnatggtcc 4 80 

agaaggacag ccgtactacc attaagattt ^agcaagga y y gtagngtcag 540 

SEES SSSSS — «S 

tantccacca cnnntnttct nannc 

SEQ ID NO: 113 tctqcagaat tcgcccttcc 60 

catgcnngag caggctcgag cgccggcagn ^gagggata ^tgcag g ^ ^ 

tatgtacttt ttcctgagcg tatacacaat cccatcatgt actgggg g J 18Q 

tatcattnac aagctgnctt tngcagatgn ^tttgnttt ctcatt gg 9 240 

acttcctcct ggcagccatg gcttatgacc 9ctgtcttgc "tctgc 3Q0 

acggagccat catgagtagc ctgctctcag cgcagctggc «tgggctcc ^9 3fi() 

gt?tcgcgcn cantgcagcg cccacagccc tcagnagcgg Jcttgctctt ^ 

S2SS 2K f : s SESE as 
SSSS =SSS 5— — £ 

nnnccannnn tnngtgcgcc gctcc 

gStaagSc Slcctctn gangcatgct cgagcggccg ccagt gtgat jjjtjt^c « 
agaattcgcc cttgttccgc "acaataga tg-aggatt aagtg^gg 
catagaagag accaaagaac ttgcccctcc ="999 aaaatqq qag gagcaggtcc 240 
agacagcnat gactgagctg tagaagaggg ^accacagt ^gatgggag ^ ^ 

sjkss r : ? ~ = 3,0 
ssss .nssss ss s» ^5—» - 
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cgtntcnnac ctcttatttt 
cgccanttcc atncncntct 
tattctcttt ctcttgnctc 



tnttncccct ttctttcgct 
ntcnnttttt ttntntnacc 
tcccttctct ctcntnttcc 



cnntccccnt tntcccnnct 540 
ntnttntcat ntcntctctt 600 
canctctccc g 651 



SEQ ID NO: 115 
ggntctcggt acaanacttg 
atatctgcag aattcgccct 
ttcctccttg tggccatggc 
accgccatca tgagccccat 
accttccatg ccatgttaca 
ttgttgntnc cccactnnnc 
tnctccnnga gtcctcnggn 
atcctttctt tanttntcca 
anctcctncn tagcttactn 
ttctctcncn tntctctcnc 
tcntnncnct tatttantnt 
nnnctcntcc gannntgntc 
ttcncgncat ntattncaca 
gcnnctacng ntgncnntcn 
cccctctccg 



gccctctaga tgcatgctcg 
tccaatgtat ttattcctgt 
ctatgaccgc tatgtggcca 
gctctgtctc gccctggtgg 
cactttactc atggccagnt 
tntgtnntna gtctnctctn 
nncgtngtcg nttncngcnn 
tntnttcact nattnctctt 
tttcntgctc tccngngctc 
tnnnncccnn nnntcntcgt 
ctcnccnctn tctcnngctc 
atggcatctn cacattnngc 
tccacntgca ctcctactcn 
nccgctcntn cggccncnat 



agcggccgcc agtgtgatgg 60 
tatttggaga cctggagagc 120 
tctgcttccc cctgcactac 180 
cgctgacctg ggtgctgacc 240 
tgtgcttntg tnccnnacna 300 
ccntnnactg ctcntcctct 360 
tcaattgcan tnccnncntc 420 
tatccncnnt ntcnccctcc 480 
anccttttcn ccatatnttc 540 
ntctctgctc cntcttnacg 600 
cancgcngta ccngccctat 660 
cctactatnn ncgatctatn 720 
ctctctancc nccgtacatc 780 
nnctccactt tntctnggtc 840 

850 



SEQ ID NO: 116 
gatgcatgct cgagcgcccg 
ctttttcctg aagaacctct 
atccatccgt aactccctga 
agcctatttt ttctctgcct 
tgaccgctat gttgccattt 
gtgctatcag atggcagtca 
tggcaacatg tttcgggagc 
catccctcag gtgttggccc 
ccctgagcct caatgcttgg 
anatcttctn aanggggctc 
cccnnctgct tccccccacg 



cagtgtgatg gatatctgca 
ctgttttgga tctgtgctac 
ctcgcagaag ctccatctct 
ttgcatctgc tgagctggcc 
gccaccccct ccaatacaga 
ccacctggct aagctgcttt 
acgtttgcag atccaatgtg 
tggtttcctg ngaggttttc 
ntctgggatg ctttattccc 
nagaatccct tnaggaccag 



gaattcgccc ttccaatgta 60 
atctcagtca ctgtgcctaa 120 
tatcttggct gtgtggctca 180 
ttccttactg tcatgtctta 240 
gccgtgatga catcaggagg 300 
tcctacgcag ccgtccacac 360 
atccaccagt tcttccgtga 420 
tttgtagagc tttgacccng 480 
atgatgatct ccnattttcc 540 
antcnagcta aaagcctttn 600 
' 620 



^cnctSg -Usctc gagcggccgc cagt f gatg jjtjtetjc. gaattcgccc « 
ttccaatgta tttgttcctg ttatttggag ^ctggagag cttcctcctt g gg W 

cctatgaccg ctatgtggcc atctgcttcc <=cctgcacta "ccgccatc UQ 
tgctctgtct cgccctggtg gcgctgtcct gggtgctgac «ccttccac 

acacttLct catggccagg ttgtgttttt f^agacaa tgtgatcccc "cttt ^ 

gtgatatgtc tgctctgctg aagctggcct tctctgacac tcgagttaat g ggg j ^ 

tsszz sss " - kr,s z 

SEES SSSSS £ 

ctgtggtgac ccccatgctg aacccctt 



SEQ ID NO: 118 
gatgatgctc gagcggncgc 
ttgttcctga gcaacctctc 
gcactggcca tcctactggg 
atgnacnntg ttttctcant 
gacgcgcntn cnncnntatc 



agtgngatgg atatctgcag 
cttcctggag atttggtata 
gagaagacag accatatcat 
angccntaca gngnncatgt 
nnnntntnct ntnacncnac 



aattcgccct tcccatgtat 60 
ccacagcagc agtgcccaaa 120 
ttacaagctg ccttttgcag 180 
ttncgcnngc cntgacttat 240 
ttcntcatna tntgnncntn 300 



17/26 



WO 01/27158 



PCIYUS00/27582 



nnttcncctn tggcnnctcn nntcncggnc ttncctatgn acgtcntcnc ccttnggcct 360 
gcatctctnc ntnntcctnn ccnncgnnct ntctttcctt ^taccnt J 480 

tccctcccct ctctgnntgc nnctncnncn ""tnnntg n«tg.tcnc 
ccatcnngtn ctnttctctc gtntcttctn cncgccncct 9catcactgn J 6Q0 

sssr. sssssj ssss ;™ ;s 

=~ »™« SSSS ™« ;» 

cnc 

SEQ ID NO: 119 . nata( ., ta tctqcagaat tcgcccttcc 60 

nntagatgca tgctcgagcg gcccgcc.gt gtg.tgg.ta tctgc g 12Q 

tatgtatttc ttcctggcca acctgtcctt cttggagacc ^ cactctgtat 180 

gcccaagtta ctgtttagtt tttggtctgc S^^agc !""qcttc tggccgccat 240 

gatacaactg tacttcttca ttgctcncat ^acagaa tgcgtgcttc ^9 9 ^ 

ggcctatgac cgntatgtgg ncatctggcg ccca J£~ g ££ gncanctccc 360 

atgggctcct gctccnncct cgctntnnna tanngaaccn J«gngtag J 42Q 

tgt.cgagaa tctacttcat entnetgcct ^gcnaca aganactgtt 480 

cttngntctg nggacatttn ctccagnant Jnaantctct J cntntctanc 540 

cnttancttg annatnttcn ggnacattnt ^tanggnn ttgg g g nctncccntt 6 00 

accngcactn cncantaant gctncngttc tantcngtgc cattcntg g 66Q 

tcatngcntn ncctcccncg aaagcnaant aagtnggngt cttnacttt y ^ 



ncatcncant ggcc 



fgccc^ag aJgca.gctc gagcggccgc cagtgtgatg gatatctgca gaattcgccc « 
Scet.tgt. ttttttcctg ttatttggag acctggagag cctcctcctt J £ 18Q 
cctatgaccg ctatgtggcc atctgcttcc "Ctgcacta «ccgcc J g ^ 
tgctctgtct cgccctggtg «cgct,tcct ^^gctgac «cct ^ actttttct 3 00 

acactttact catggccagg ttgtgttttt ^gcagacaa g « gaatgggtng 360 

gtgatatgtc tgctctgctg aagctggcct - tegjgt^ J gggtcctat 420 

atatttatca tgngagggct cattcttgtc atccc aanq ggtatc tgcaaggcct 480 

gcgagaattg tctcctccnt «tcaaaggc -cttcttct janggg J tat 540 
tctctacttg gtggctcccc ccctgnctgt ^gtcactg ™ aatgg 60 0 

3SS£ ESSE g ~ c ESSE ccc 

IWc Jgojcjc c S;S gatatctgca gaattcgccc » 

ssss suss ssa= =~ =s=s sss= ' s 

tgctctgtct cgccctggtg gcgctgtcct ^ctgac -cctt^^ 3Q0 
acactttact catggccagg ttgtgttttt ^gcagac g * gaatgggtga 
gtgatatgtc tgctctgctg -gct^gcct tctctgacac tcgagt J gtc ? tatgc 
tatttatcat gggagggfctc attcttgcat cc "" aaa tatctqc aaggccttct 
aagaattgnc tccttccatc tcaaggnccc "nttctaaa WJtatctg ^ 
ctanttgtgg ctcccaccct gtcttgtggn tggcactgnt tctaatggga 

SSSS SSSS ccnatcattc ?tac n ng 



360 
420 
480 
540 
600 
657 



SEQ ID NO: 122 
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atgaccctna gatgcatgct cgagcggccg cc.Jtgtgat ggatatctgc a^ttcgcc 60 q 

cttUaatgt atttgttcct fccaacctg tcqtttttgg £.ttw ttatggggag 160 

ataattccca atatgctaga tcatattagc jcaggaatta ag 9 ctt tctcctg 240 

tgtctgacac aactctattt ctctggccta "tgcagatc W tgcccta acc 300 
gctgtgttgg cccttgaccg ctatgtggcc Jtc.jcc.tc ctctccat 360 

atgaactccc aacgctgtgt cctgttggtg J^gtgtcat ggg g tatta tccct 4 20 

ISSUS U c « ? »c «. 

ssss «™ ssee ssn£ «™ — s 

gggcccttnt ccagcntcnc nn 



SEQ ID NO: 123 frn , r cttcc aatgtatttg tttctgttat 60 

gcgncgcagt gtgatggata tctgcagaat tcgcccttcc J t 120 

ttggagacct ggagagcttc ctccttgtgg «atggccta ctggtggcgc 180 

gcttccccct gcactacacc gccatcatga ^cccatgct g g gccaggt tgt 240 
tgtcctgggt gctgaccacc ttccatgcca tgttacacac • ctgctgaagc 300 

gtttttgtgc agacaatgtg atcccccact "ttctgtga tatg g tcattc 3 60 

tggccttctc tgacactcga gttaatgaat gggt gatatt tccatcctca 420 

t?gtcatccc attcctactc atccttgggt cct.tgc«g -ttgtctcc 48Q 
aggtcccttc ttctaagggt ^ctngcaag 9~ttctct. cttg gj ataagt 540 

ftSS^ ^gac ccccatgctn 600 



aaccccnttn 



SEQ ID NO: 124 ^ rn ... fltl , taatqqatat ctgcagaatt 60 

ccttgggccc tctagatgca tgctcgagcg ^ccgccagtg ^gatgg | 12Q 

cgcccttctt tattcctgag tgaatatatg ««gctj ^JJ^., 18C 

aggaaaatgg aaactagacg aacgtgacaa ^ccacgtgg J ggatggtcat 24 0 

ctgaatgcca aagggcaggt cacagaggag 9«g a ccagc actctg g ™ 3 00 

gtacagcctg gtcaagggca tcttccggga tccacaaagg atcctga g J a 36Q 
gctggacccg cagagaacca cacataaaaa ^tcagccat gtg g g cca 420 

?gtttcacac caaacagaat caagcaccac tagacaggaa gccac g ^ 
ggatgctctg cagcagggac agggcccaga ^aggacaca cga 9 tt 540 

Sgngtggct gcnagctctn cttaggatng ^cccaagga ttgncc gg £ t 600 

SSSS S» SSSS ™ct tcccttgtcg 660 

EcSJS atgcatgctc gagcggccgc cagtgtgatg gatatctgca gjjttogcec « 

?tcctatgta cttcttcctg "jtttgw -^gj atgagcccca 180 

cctatgaccg ctatgtggcc atctgcttcc ccccgcao _ acc t tccat gc catgttac 240 

tgctc?gtct cgccctggtg gcgctgtcct ^gctgac «ccttcc.t * ^ 3QQ 

acactttact catggccagg "gtgttttt Jtgcagacaa tfltj gaatgggt ga 360 

gtgatatgtc tgctctgctg aagctggcct tctctgacac JJ gggtcct atg 420 
SSSSS X2XS cttcttc t aa ggtatctgc aaggcctt. 4 jo 

f a ™ s=s ssas sssss 2s=.a s 

acactgtggt gacccccnat gctgacccat tc 
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315 fSS S5S HI is SIS I 

gcctcatcac ctttjtctcc tjtgtjg-c. 39 ac "tccttc, J M 48Q 

gncagtgccc g agna.aag ^""ccac cgnccggatt ntccattcaa aagatgncct 540 

SSSSS SSSS: c!,..S~t .nngtngact t cangn ttt a 600 

aaactccttt natntcactn ttangggaac naggggcgnn ac 



SEQ ID NO: 127 
ntgngccctc tagatgcatg 
gcccttccca tgtatttatt 
ttacacaact caccttcttc 
tcaacttctc aaccttgcct 
tatccctata tttggttttc 
cctcctccat tccaagaatt 
cntctnaccc gnnagtcgnt 
cagcnttngt cancnttctc 
tctcntncnc tttctgcctc 
gccagcnccn nangntccnc 
tnnctcatnn nnncgcncnc 
cgcctacgac ctccctgtnc 



ctcgagcggc cgccagtgtg 
ccttagcctg ttggattccc 
aagaatgtgg aaanctataa 
gttctgacag catcatcaat 
ttcccattnc agggatcctt 
ccattgncag acgggacgna 
tntttatctn tgnantnccc 
cncacnnntt cgtcgntgtt 
ccctccanng tctnnctttc 
ccctctccct cntgtctnct 
ncgctctccn cccntntctn 
ncnnccgg 



atggatatct gcangaattc 60 
agctgcacag ctggattgtg 120 
ttttttttct gtgacccatc 180 
aacatattat gtattttaga 240 
ttgncttacc atanaattgt 300 
tnangccttc tctacctgtn 360 
tngggcgncn nccctgncct 420 
ncccagtnct gtnctnctnc 480 
tcagcnccct tnngncncnt 540 
cnctcctntt cttcntntcc 600 
tacqactccn gncgtctctn 660 

688 



SEQ ID NO: 128 
gcgtgctgcn agcggggcgg 
atttctttct aagcaactta 
atatgctcac tgacttcttc 
agtttttttt ctttgtcggc 
acgaccgata tgcagccatc 
tctgtacacg catggtggtt 
ccagnnccat atttaggctt 
acctccacca gtcctggctc 
tntcccgtgg tgntcacntg 
cangggntac catagngtct 
gnaagccctn ccaccgccc 



cagagtgagc ggatatctgc 
tctttcattg acatctgcta 
tgggagcaga agaccatatc 
atgggtctgt ctgagtgcct 
tccagccccc ttctctaccc 
gnggcatatg ttggtggctt 
cacttttgcg gacccaacat 
tgtcttgctc tgacaccttc 
tcgngaggaa acatcgnttt 
gcgngtccct gaagaatcct 



agaatncgcc cttccgatgn 60 
ctcttctgct gtggctccca 120 
atttgtgggc tgtgctgctc 180 
cctcctgact gctatggcat 24 0 
cactatcatg acccagggcc 300 
cctgagctcc ctgatccagg 360 
catcaaccac ttcttctgcg 420 
cttnagtcaa gncgncgaat 480 
cctccaaccc cttantctcc 540 
tttngccaan cgggcgaatn 600 

619 



SEQ ID NO: 129 
gcggcgcagt gtgatgntat 
cttatctttc attgacatct 
cttctgggag cagaagacca 
cggcatgggt ctgtctgagt 
catctccagc ccccttctcn 
ggtngcgccn tatgntngtt 
ntaggctcna ccnntcgngc 
ctccancacg ttcctggctt 
cncctganct gcnnnttctt 
ttnccatnct gctcnatnct 



ctgacgaatt cgcccttccg 
gctactcttc tgctgtggct 
tatcatttgt gggctgtgct 
gcctcctcct gactgctatg 
accccactat catgacccag 
gnctncntng agctccctga 
tcccgntcca ncancnaacc 
ttctgcnntc gcctcncgnc 
ccangccngc ncgcncancc 
nctctcatcn nttcctantn 



atgtatttat ttctaagcaa 60 
cccaatatgc tcactgactt 120 
gctcagtttt ttttctttgt 180 
gcntacgacc gatatgcngc 240 
ggcctctgta cacgcatgga 300 
nccannnctn ntcacntatt 360 
ccnttcgttc ctgnanactt 420 
nnccttatnc ttnangntca 480 
cgntctntct gnngaancct 540 
ctctccnnct cncgctcnnt 600 
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nncttncnct ctnaacctnt cnnatcctca cctnngatat cctcncgntc tttcgncntc 660 
nttcnctgtc cganntcctc anacncntcc ctanncg 



SEQ ID NO: 130 
ctctagatgc atgctcgagc 
tatgtattta ttccttagcc 
ccctaagatg ctgatgaaca 
ttcacagaca tattttttca 
ggcatataac aggtatgtgg 
gagccagtgt gtcatgctgg 
gcgtaccctc ctcctggccc 
ctgtgacctt ggtgccctgc 
aatctttaca ggagcattga 
tgggcanatn tgggggtcac 
tggccacttg tggnncccnc 



ggccgccagt gtgatggata 
acttggccct cactgacatc 
tgcagactca gcacctagcc 
tattttttgc tgacttagac 
ccatctgaca tcctctacat 
tggctgggtc ctgggtcatc 
agctttcctt ctgtgctgac 
tcaagttggc ctgctcagac 
cnggcattat gcttccattc 
cattctncag anttccttta 
tcncg 



tctgcagaat tcgcccttcc 60 
tccttttcat ctgtcactgt 120 
gtcttttaca agggatgcat 180 
agtttcctta tcacttcaat 240 
tatgccacca tcatgactca 300 
gcttgtgcgt gtgctctttt 3.60 
cacatcatcc ctcactactt 420 
acctccctca atnagttagc 480 
ctgngcatcc tgggttctta 540 
ccagggcatn tgcaangcct 600 



SEQ ID NO: 131 
ttggcctcta gatgcatgct 
ttgatacatg attgggttgc 
tcgggttgcg gaaggaataa 
gaaggaataa atcatcgggt 
tacatcgggt tgcgtaagga 
cgtaaggaat aaatcattgg 
atctttgtgc tggtaccgat 
ggcgaattcc agcacactgc 
gatgcntagc ttgagtattt 
gaccnagctt gncttccttg 
ncanaatnnt nncccngntn 



cgagcgccgc cagtgtgatg 
ggaaggaata aatcatcggg 
atacatcggg ttgcggaagg 
tgcggaagga ataaatacat 
ataaatcatt gggttgcgta 
gttgcgtaag gaataaatca 
ctatcatggg gttacgaaag 
cgnccgctac tagtgggatc 
taacgccgcc aacctaaaat 
cgtnaanttt cnttattcct 
ancacncann ttntatannc 



gatatctgca gaattcgccc 60 
ttgcggaagg aataaataca 120 
aataaataca tcgggttgcg 180 
cgggttgcgg aaggaataaa 240 
aggaataaat cattgggttg 300 
ttgngttgcg taaggaataa 360 
ggaagaaata cattggaang 420 
cganctcggt accaagcttt 480 
ngcnttggcc ttacncnntg 540 
cctntntntc ttctcccccc 600 
ctngngctcc cctantc 657 



SEQ ID NO: 132 
tggcccncta gatgcatgct 
ttcctatgta tttattcctt 
tcacacctaa gctcttggtc 
gtgtggtcca gctcgccttc 
ccatggccta tgaccgcttc 
ccagggggac ctgtctccag 
ctatccagac tggaaatgtc 
actactgtga cataccaccc 
nggncctcna tgncttttct 
taccggcttg ggcttggggg 
ggactcccca cttntgcctc 



cgagcgncgc cagngtgang 
aatgtcctct cgcttcttga 
aacttcctgg tctctgacaa 
tttgtagtgc atgtgacagc 
ctatccatct gtcaacccct 
ctggtagctg tgtcctatgc 
tttgccctgc ctttctgtgg 
cttctccacc tggcttgtgc 
gntctggcac ccttctggcn 
ccaatnggga ggatgcgcct 
ccnn 



gatatctgca gaattcgccc 60 
tatttgttac tcttctgtgg 120 
gtccatctct tttgagggct 180 
tgagagcttc ctgctggcct 240 
ccattatggt tctatcatga 300 
atttggtgga gccaactccg 360 
gcccaaccag ctaacacact 420 
caacacagcc acagcaagag 4 80 
gctgcaggca ttctcacctc 540 
caagaacagg gagggagaaa 600 

624 



SEQ ID NO: 133 
ggagttgata tgaacgggtt 
ttgcccctcc cttgggcata 
cagaagaggg tgaccacagt 
gtggnagagn taattctcag 
gctgagaggc acaaccacga 
ggnatctcca caggagagtn 
tgctggcggg nacagatgcc 
canacccatt atctcangcc 
tngntgntnt ncnnnattnn 
ncncanantt atggnncncc 



aagtgaagga gtgcccactg 
cggatttttg ggctggaggt 
gagatgggag gagcaggtcc 
cactgcctgg gcagtcggct 
agatgacact ggacacangc 
gaatcagaga tgggacnttc 
caagcgnnan ggngntatgg 
acatgtatnt cagcntttna 
ccnntctttn tcccntcann 
cgnacncnct cngtnactcc 



catagaagag accaaagaac 60 
agacagcaat gactgagctg 120 
naaaggcctt tctccatgct 180 
ncataagagg caaggatgag 240 
caactgtatc cattgtagga 300 
acattaanaa gttatttatn 360 
tnctggncna ttnnttcgtc 420 
ntcncnntnt nagtntagtc 480 
tatcattntc attccttncn 540 
cctnnngncg 590 
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5 *VrZ ^cct cjtccctct. fl .t S t S =cll ™- g 0 
SUES SSSS cc joojjttc. jjjgte;- «. 

ctgcctcagc ctcttgagta QCtggaatta "gatgtgag ccaccatg 300 

agatgttttt gaaacaaccc ccaccagcac tggagggagt caaggg g accac 360 

atctgagctc ctctgtctct gcctttcctt ^cactgtcc ccagggta * 420 

ccccatcacg aaccccttca tctacacatt acgtaacaag jgcgwtt 9 tattcta 480 

cggccgttac tagtggatcc gagctcggta ccaagcttga tgcatagct J J ^ 

5i2 E= S= iil a= sr- a 



SEQ ID NO: 135 
ttgngccctc tagatgcatg 
cccttcctat gtacttgttt 
cagcagtgcc caaagcaccg 
gctgtctttt gcagatgtac 
cagccatggc ttatgaccgc 
tgagtagcct gctctcagcg 
attgcagtgc ccacagccct 
cacttcttct gtgacattgc 
aagagcttgt ggcctttgng 
ccntttntcn nctatgnngt 
cnagtggccc ggcgngcnaa 



ctcgagcggc cgccagtgtg 
ctaagcaacc tctccttcct 
gccatcctac tggggagaag 
tttgttttct cattaggctg 
tgtcttgcca tctgctatcc 
cagctggccc tgggctcctg 
catcagtggc ctgtccttct 
accctggant gccctggcct 
aatcgcctgn tggggctanc 
acantcatta agcnccaatc 
ngncctnctc cccgtnccn 



atggatatct gcagaattcg 60 
ggagatttgg tataccacag 120 
tcagaccata tcatttacaa 180 
cacagagtac ttcctcctgg 240 
tttacactac ggagccatca 300 
ggtggtgtgg tttcgtggcc 360 
gtggttcccg tgccatcaaa 420 
gcaccaacac cacaggcagn 480 
cttnngtcat gccctnatca 540 
nctcatggga tccccttttg 600 



SEQ ID NO: 136 
tgnccctcta gatgcatgct 
cttccgatgt atttgtttct 
tccatcccca aaatgctggc 
tgtctngcac agctatattt 
gtgatgccat angaccgtta 
agtccccagc tctgtgcact 
gctgcacatn ctgttnccnc 
cntatcannt tcngctnccc 
ctttcttctc tcnccntnct 
tnctctcnct cccnctcacc 
cccctctccg cntcccctga 



cgagcggccg ccagtgtgat 
agccaacctg tcattaactg 
caacattcat acccagagtc 
cctccttatg tttggnggcc 
tgtggccatt tgccaaccac 
antgctgcnc gtgtgctgng 
ccccncgngg nctctttnnn 
tttcttctcc ccccnnttct 
canatnatca gtccnacctc 
ngntngtcta gtctgccgtc 
tcgtcctngt ctaccctcnc 



ggatatctgc agaattcgcc 60 
atgcttgntt cacttctgcc 120 
agatcatctc gtattctggg 180 
ntgacaactg cctgctggct 240 
cccattacag cacatctatg 300 
tgcnanccan ttgtctgcct 360 
ccgnacccnc cctacaantc 4 20 
tnccnccttc ctcnnnccta 480 
nccttcnttt cttcactnan 540 
gccccntcgc tatcnctncc 600 
catctnatcc ctcc 654 



SEQ ID NO: 13*7 
ctctagatgc atgctcgagc 
aatgtatttt tttctaagca 
gcccaaagca ctggccatcc 
tttgcagatg tactttgttt 
ggcttatgac cgctgtcttg 
cctgctctca gcgcagctgg 
ctcctagcnc tcatgnnnnc 
nacacttgta cctcncgnct 
gccttntctc cccttcgctn 
ttngacccct ntanncncnc 
ntgtcnnccg antangntac 



ggccgccagt gtgatggata 
acctctcctt cctggagatt 
cactggggag aagtcagacc 
tctcattagg ctgcacagag 
ccatctgcta tcctttacac 
ccctgggctc ctggncgtgn 
cttgccttnt gggnccntgn 
tgccctnnnc tgcttctaan 
gttnatcttn anntncntgc 
tcttcttcnn anntcccctc 
ntntcacnnt ntntcnnctn 



tctgcagaat tcgcccttcc 60 
tggtatacca cagcagcagt 120 
atatcattta caagctgtct 180 
tacttcctcc tggcagccat 240 
tacggagcca tcatgagtag 300 
ggcttngtgn cnttgcngcn 360 
nnatcaccct nttnctctgt 420 
tccctnngtt gtantncctn 480 
ntctntgncc ctctccttcg 540 
tatcncnccg ntnncctccn 600 
ctctcctaac tcttnccg 658 
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SEQ ID NO: 138 
ggccccctag atgcatgctc 
ttcccatgta tttgtttcta 
cagtgcccaa agcactggcc 
gtcttttgca gatgtacttt 
ccatggctta tgaccgctgt 
gtagcctgct ctcagcgcag 
tagtgcccac agcccntatc 
ttctttctgt gacatttgcc 
nngnttncnn gnctcggcnc 
tttaatgncc ttaatnaaac 
ttcnaannna cccttttttc 
nnngtcnncc 



gagcgggcgc cagcgtgatg 
agcaacctct ccttcctgga 
atcctactgg ggagaagtca 
gttttctcat taggctgcac 
cttgccatct gctatccttt 
ctggccctgg gctcctgggt 
agnggcctgt ccttttgtgg 
cccctgcntt ncccntggcc 
cccctttgac ntantncntt 
tctcnctctt catgttnttc 
catnnncncg tctacntcnc 



gatatctgca gaattcgccc 60 
gatttggtat accacagcag 120 
gaccatatca tttacaagct 180 
agagtacttc ctcctggcag 240 
acactacgga gccatcatga 300 
gngtggnttc gtggccantg 360 
ncncccgtnc catcaacccc 420 
ctnccccaan cacngcangg 480 
gntgngcgct tatncntgcg 540 
ntttntntng gnaccaantc 600 
tctcnccttc ntcgngtttn 660 

670 



SEQ ID NO: 139 
gatgcatgct cgagcggccg 
atttttttct aagcaacctc 
aagcactggc catcctactg 
agatgtactt tgttttctca 
atgaccgctg cttgccatct 
ctnagcgcag ctgncctggg 
cttcatcagt ggncttgtnc 
atngtacccc tgnanttgcc 
ntggccnccn tgnttcgcnt 
ttnnctctat gttntcaacn 
cggannnnta nnctttccnn 



ccagtgtgat ggatatctgc 
tccttcctgg agatttggta 
gggagaagtc agaccatatc 
ttaggctgca cagagtactt 
gctatccttt acactacgga 
ctcctgggtg ngtggttcng 
ttctgngccc ccgcncatcn 
ctggccttgt anccancaca 
ngtnnccgng ntanccngnc 
tcncngncta ttcgctcang 
accntctttc cntnc 



agaattcgcc cttccgatgt 60 
taccacagca gcagtgccca 120 
atttacaagc tgtcttttgc 180 
cctcttggca gccatggctt 240 
gccatcatga gtagcctgct 300 
ngccattcag cgcccacagn 360 
aaccantttc ttctgngana 420 
tangctcgta tgngcttctn 480 
tnnacgtcct ttcnnacact 540 
atanccactc ttncanncnt 600 



SEQ ID NO: 140 ^*-„4-„ a frrnAtatcta cagaattcgc 60 

atgaccctct agatgcatgc tcgagcggcc gccagtgtga tjg«t.tctg J 12Q 
ccttcctatg tatttatttc taagcaacct ctccttcctg ^"tggg 180 
cagcagtgcc caaagcactg ^ccatccta ctggggaga £c.,.cc.t ^ 
agctgtcttt tgcagatgta etttgttttc tcattagg * a cgga gccatc 300 

gcagccatgg cttatgaccg ctgtcttgcc aaat qtgtgg tttcggnggc 360 

atgagtagcc tgctctcagc gcaagctggc ctgggctcct ^tgtgg ^ 
cattgcagng cccacagcnc tn.tc.jtgg ^tccttt ctgtggg 48Q 
tcaacccacn tttctttttg "W^attgg "accccntg g « ttnt tt 5 40 
ngcacncaaa ccancaccag ggtcngnnna "anctttgn V™ acctttttgn 600 

ggcctnggtg ngggnntaat tcnctttggn "tnaatgcc cttcc.. ccnecttttt 660 
cnttcntatg ggngnnccct tnnattcnag ««acancc CC ccccccg 709 
gtcaagtgng nccggtnann naaaagccnt nttccnnntg ccccccccg 

SEQ ID NO: 141 ,.„«.„. trrnatatcta cagaattcgc 60 

ntgggccctg agatgcangc tcgagcggcc «cca,tgtga tggatatctg J ^ 12Q 
ccttcccatg tatttttttc taagcaacct ctccttcctg ^ttgg catt tacaag 180 
agcagtgccc aaagcactgg cc.tcct.ct MM»^ J gtact tcctcctggc 240 
ctgtcttttg cagatgtact ttgttttctc attaggctgc y y gagcca tcat 300 
agccatggct tatgatcgct gtcttgccat ^gctatcct "acactacg g g 
gagtagcctg ctctcagcgc agctggccct ^ctcctgg ^ctgtggtt ^ 

sees ess; usee ssus c - i?~ - 
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ggcngttngn gctgttggcc ctttcgctca 
nngcctccnt ctactctctg cnttctanct 
cctcgatctc ctttcangnc tccgctncac 
nntcnnntnc g 



cncngtctgc gcccattctc nctgtnncaa 540 
antnncncct ctttnctncc tnnantctnt 600 
tgctcnctna acgtccnttt cttccctnnt 660 



SEQ ID NO: 142 

gggcnncttt gggtatgcct tgncccttag 
gatatctgca gaattcgccc ttccaatgta 
tgcctgtttc acttctgcct ccatccccaa 
gatcatctcg tattctgggt gtcttgcaca 
tgacaactgc ctgctggctg tgatggcata 
ccattacagc acatctatga gtccccagct 
gctaaccaac tggnctgccc tgatgcacac 
ccnntangcc nctncnntcn ttccnttntn 
cttccccnnt cttcctnntg tactcnctan 
ttctctntcn ctttcgnncn tnnttnctnc 
ttcnnaatcg ctnctatcnc cgcctatagt 
nccntcctcn ccatcantnc taacctnctn 
gnccnatttc nttttcccn 



atgcatgctc gagcggccgc cagtgtgatg 60 
cttatttcta gccaacctgt cattaactga 120 
aatgctggcc aacattcata cccagagtca 180 
gctatatttc ctccttatgt tnggnggcct 240 
tgaccgctat gtggccatct gccaaccact 300 
ctgtgcacta atgctgtgng tgngctgngt 360 
actgttnctn atcccngcgc tttcttggtc 420 
tntctctacc tctcccntcg ngctctnccc 480 
nnctgttnnn ccccntcntt ctcttcttcc 540 
tcttgtccct acctgtcccn ntcatacctt 600 
ncaattcnnc tncctnctnn attncctacn 660 
cntnntctct ntctctgtcc tcancntctc 720 

739 



SEQ ID NO: 143 

gatgcatgct cgagcggccg ccagtgtgat 
aattgggttc agcatggggg tcaccacagt 
agtagaacta ttagctgatg agcataagta 
caccacagac aggtgggagc cacaagtaga 
gaccttgagg atggaggaga caattcttgc 
gacaagaatg agccctccca tgataaatat 
ggccagcttc agcagagcag acatatcaca 
aaaacacaac ctggccatga gtaaagtgtg 
ggacagcgcc accaggncga gacagagcat 
gangcagatg nccacantag tgntnatagn 
ctttnaataa c 



ggatatctgc agaattcgcc cttgatagat 60 
gtacatcata gccatgacag tgtcctttag 120 
gagaccaata acggttccat agaacagtga 180 
gaaggccttg cagataccct tagaagaagg 240 
ataggaccca aggatgagta ggaatgggat 300 
cacccattca ttaactcgag tgtcagagaa 360 
gaaaaagtgg gggatcacat tgtctgcaca 420 
taacatggca tggaaggtgg tcagcaccca 480 
ggggctcatg atggcggngt agtgcngggn 540 
ccatggtcac angggaggna gctttcaggn 600 



SEQ ID NO: 144 

gcgtgctcga gcggccgcca gtgtgatgga 
gagtacatga aggggttaag tgaaggagtg 
cccctccctt gggcatacgg atttttgggc 
aagagggtga ccacagtgag atgggaggag 
gnagagttaa tcctcagcac tgnctgggca 
agaggcacaa ccacngaaga tgacactgta 
atctccacag gngagnccaa tcagntgatg 
tnctnttgnc ngncacgang gtcctnnnng 
taccntccct cntccctcnt nttttctttc 
gtncncntnt atcttcccta ntncntcttt 
tncntccctc tcnancntat cncttgnncc 



tatctgcaga attcgccctt gttgcgcaaa 60 
cccactgcat agaagagacc aaagaacttg 120 
tggaggtaga cagcaatgac tgagctgtag 180 
caggtcccaa aggcctttct ccatgctgtg 240 
gtggctccat aagaggcang gatgaggctg 300 
cacagccaac tgtattttat tgnaggnggn 360 
gntcccnccc atttcanaag tcactntatn 420 
agcngttctt gtccnntctt nactatcgnt 480 
cncctncctc ttcntttnnc cntntcccnt 540 
tnntnctntt tngnnncctt cctctntctt 600 
cncccnntnc c 641 



SEQ ID NO: 145 

ggttgccnnc gnttaggcat tgggccctct 
tggatatctg cagaattcgc ccttccgatg 
gagatttggt ataccacagc agcagtgccc 
cagaccatat catttacaag ctgtcttttg 
acagagtact tcctcctggc agccatggct 
tttacactac ggagccatca tgagtagccn 
tggntgngct ggtttctcgc cctattnttn 



agatgcatgc tcgagcggcc gccagtgtga 60 
tatttgtttc taagcaacct ctccttcctg 120 
aaagcactgg ccatcctact ggggagaagt 180 
cagatgtact ttgttttctc attaggctgc 240 
tatgaccgct gtcttgccat cctgctatcc 300 
tgctctcagc tgcagctggn cctgggctcc 360 
ncnnnacnnn ccntantcng ncnctnctct 420 
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ctttcttnnt tccctttncc 
nttgtntntc gnttctcccn 
ccctttgntc tctacnctct 
gatnttcncc tncttactgc 
ctnacnantc tntcactcnt 
nntncctcac cnnntacatg 
ttncatactc nctaacttct 



tcactcatnc ctcnctctct 
ntcntnnctt ctctnttgct 
tncgnantca ctnnnatntc 
tactctctnc tatactnnnc 
tcccanncnn tcnctgtcnt 
gttccttnnn ntccatctcg 
ctccatcatc ntcacctntc 



tttnntgtcc tcttnataac 480 
tcnctctcct cntttcgnat 540 
tnntcacgng cntcctcnnn 600 
ttntntncat anttcgtctg 660 
ctgactctcn cctcntntnt 720 
tcnntctctc cnntatacgn 780 
tttctttntc cctngnc 837 



SEQ ID NO: 146 
gatgatgctc gagcgncgca 
tatttctagg caccactgac 
caatctgccg accactccgc 
tgctggcctc ctggctagct 
gcctgccttt ctgtggcccc 
tcaggctttc ttgtggggac 
tggtggtact gggcccacng 
ttctnagngc ccncnanngc 
tcggaantta aagggggtgg 
tnccanngnn tnntngctca 
gtanactgcc nttaattnga 



gtgtgatgga tatctgcaga 
ttcttcctct tggccgtcat 
tatgagaccc tgatgaatgg 
ggattcctct gggtcctttg 
aatggtattg accacttctt 
acccacctgc tgaaactggn 
gctctgacct cagntttcta 
ttgccngagc gaagcanaag 
cgcnncancn nctgggnngc 
ntccctntnc tcntcncaat 
ccnctttccc nacncncac 



attcgccctt ccaatgtatt 60 
gtctctggat cgttacctgg 120 
ccatgtctgt tcccaactag 180 
ccccactgtc ctcatggcca 240 
tcgtgacagt tggcccttgc 300 
ggctttcatg ctctctacgt 360 
ngcccgcatt cttgccactg 420 
atnnttttca cattgcgcac 480 
ttcattctnt ctttttactt 540 
cntnnnggcn ctcntgntnn 600 

639 



SEQ ID NO: 147 
catagatgca tgctcgagcg 
tgtaagttct ttctaggcac 
tacctggcaa tctgccgacc 
caactagtgc tggcctcctg 
atggccagcc tgcctttctg 
cccttgctca ggctttcttg 
tctacgttgg tgttactggg 
gtcactgtct caggncccct 
cnatcttaca ggggtggcat 
tcccagctat antccaaagt 
ccttcttcgt aacctncc 



gccgcagtgt gatggatatc 
cactgacttc ttcctcttgg 
actccgctat gagaccctga 
gctagctgga ttcctctggg 
tggccccaat ggtattgacc 
tggggacacc cacctgctga 
ctcactggct ctgacctcag 
nnagntgctg ngcgaaggaa 
catctnangg ggngnntgca 
nctnaaaaca ngancctcgg 



tgcagaattc gcccttccga 60 
ccgtcatgtc tctggatcgt 120 
tgaatggcca tgtctgttcc 180 
tcctttgccc cactgtcctc 240 
acttctttcg tgacagttgg 300 
aactggnggc tttcatgctc 360 
nttcntangc ctgcattctt 420 
agcgcntttc acttgcgcct 480 
tccttnncta nntncncagg 540 
nangannnct nntattctac 600 

618 



SEQ ID NO: 148 
cntagatgca ngctcgagcg 
atgtattttt tctcactaac 
cccagctgct tttcaatcta 
tccagctctt catgttcctg 
ctatgaccgc ttcattgcaa 
gctgtgctgg aagttggtgt 
tgcctcctgn gactatgaag 
ttnntgngat gccngcntcn 
gcattnngtc nccttnatnn 
tcnttcntng gcttancntt 
cttctanctc tncatctttc 



ggcgccagcg tgnngnanat 
ttgtctttcc tagatctctg 
ggcagcccag gcaagactat 
ggcctgggtg gcaagagtgt 
tctgcaagcc ccttcactat 
ctgtggcccg ggggtgttgg 
cttgtcacga tgcggaagat 
tataaaaanc annctgggcg 
catcnnattt gcctngnngt 
ctncaccngn ncttncntan 
ttnccntcca tec 



ctgeagaatt cgcccttcca 60 
cttcaccacc agttctatcc 120 
cagccacacg ggctgtgcca 180 
attctcttgg cagccgtggc 240 
tctgtcatta tgcaccctca 300 
actccncagt tntctaggta 360 
gtaagnttgc ancnttneen 420 
ggtcacagtg cttnngnata 480 
ccctcgttcc cantntnean 540 
ctactccntn ttnnttcntc 600 

633 



SEQ ID NO: 149 

gatgeatget cgagcggccg ccagtgtgat 
agactataca tgaatgggtt tagcateggg 
ttctgetget cctcaggatg gegggactta 
aagaagagtc ccactacgca gaggtgggag 
tccccagact ggatcctcag gatggccgcc 



ggatatctgc agaattcgee cttgttccta 60 
ttgaaagaac tgtaaaatag aaaaaggacc 120 
ggggccatgt acatgacgat ggcgctgcca 180 
gagcaggtgg agaaggcett tctgcggccc 240 
aggatgtgtg agtaggagac cagcaccagg 300 
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cagagtggtc ccaccaggat 
gtatcagcac aggccagcct 
acgaggccca canaaagggc 
agccacattg tcccaggaag 
ngnnttattc ggagagnntg 
ccancnggng ccttcantna 



gaacatgcag gctgcaaaga 
gaggacagac aggatttcac 
agtcttagga tgaggntcac 
ngntgnccag agtgatgcag 
nnagacnggt cancgttccc 
tgtc 



tgaccacctg gttgagccag 360 
aagaagaagt ggttgatttc 420 
atggaccata gccaggaggg 480 
acttttcagg tcntgatgat 540 
gntcgtagga caattancac 600 



SEQ ID NO: 150 
gatgcatgct cgagcggccg 
atttatttct ctctgacctc 
agatgctggt caacctctgg 
agctcttcat cttcctgtcc 
ttgaccgata cgtggctgtc 
tgtgctggca gctggcatct 
catcatccac cctccacttg 
aggtcccatc tctgattcga 
ctgtgtccag tgtcatcttt 
gccactgccc aggcnggggc 
nggacctngn n 



ccagtgtgat ggatatctgc 
tccttcttgg acctctgctt 
ggcccaaaga agaccatcag 
ctggggacca ctgagtgcat 
tgccagcccc tccactatgc 
gtggcctggg ttatgagtct 
cccttctgtc cccaccagca 
ctctcctgng gagatacctc 
ggtggntgtg cctctcagcc 
tgaggattaa ctttgccnna 



agaattcgcc cttccaatgt 60 
taccacaagt tgtgtccccc 120 
cttcctggga tgctctgtcc 180 
cctcctgaca gtgatggcct 240 
caccatcatc cacccccgcc 300 
ggttcaatcg atagtccaga 360 
gatagatgac tttttatgtg 420 
ctacaatgaa atccagttgn 480 
tcatccttgc ctcttatgga 540 
gccatggaag aaaggtcttt 600 



lilillllsiilil 
llillllliiiiil 

2SSS SSSS •» S°, 

gaaccgntan aacgntccc 



SIS SSSS 5SS5S SS2= ssss ssg s. 

S= =5 ii 111 lil i 

f'f": SStcSc SS££5 "tcnn?„n„„ cacncttnnc tct,c.«tc 660 

SS= ZZSS "tcttc'ct. cncttct one tgjj-j, «=t«cnc 
„? t ct,ct=t c ? « S tc=c c« S «ct» c,=.. c 8,0 

~H ™ ™ SSSS =22 =" = 
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<220> 

<223> Synthetic construct 

<221> VARIANT 

<222> (1) . . . (329) 

<223> Xaa = Any Amino Acid 



<400> 1354 



Glu 


Glu 


He 


Leu 


Xaa 


He 


He 


Ser 


Gin His Val His 


Thr Gly Cys Val 


1 








5 








10 




15 




Gin 


Asn 


Xaa 


Glu 


Leu 


Gin 


Pro 


He 


Leu Phe Gly Leu 


Phe 


Leu Ser 


Met 








20 










25 




30 




Cys 


Leu 


Val 


Met 


Val 


Leu 


Gly Asn 


Leu Leu He He 


Leu 


Ala Val 


Ser 






35 










40 




45 






Ser 


Asp 


Ser 


His 


Leu 


His 


Thr 


Pro 


Thr Tyr Phe Phe 


Leu 


Ser Asn 


Leu 




50 










55 




60 








Ser 


Leu 


Ala 


Asp 


He 


Gly 


Phe 


Pro 


Ser Thr Thr Val 


Pro 


Lys Met 


He 


65 










70 






75 






80 


Val 


Asp 


He 


Gin 


Ser 


His 


Ser 


Arg 


Val He Ser Tyr 


Ala 


Gly Cys 


Leu 










85 








90 




95 




Thr 


Gin 


He 


Ser 


Leu 


Phe 


Ala 


Val 


Phe Gly Cys Met Glu Asp Met 


Leu 








100 










105 




110 




Leu 


Ser 


Val 


Met 


Ala 


Tyr 


Asp Arg 


Phe Val Ala He 


Cys 


His Pro 


Leu 






115 










120 




125 






Asp 


Tyr 


Pro 


Val 


He 


Met 


Asn 


Pro 


Cys Phe Cys Gly 


Phe 


Leu Val 


Leu 




130 










135 




140 








Leu 


Ser 


Phe 


Phe 


Leu 


Ser 


Leu 


Leu 


Asp Phe Gin Leu 


His 


Asn Trp 


He 


145 










150 






155 






160 


Ala 


Leu 


Gin 


He 


Thr 


Cys 


Phe 


Lys 


Asp Val Glu He 


Pro 


Ser Phe 


Phe 










165 








170 




175 




Cys 


Asp 


Pro 


Ser 


Gin 


Leu 


Pro 


His 


Leu Ala Cys Cys 


Asp 


Thr Phe 


Thr 








180 










185 




190 




Asn 


Asn 


He 


Val 


Met 


Tyr 


Phe 


Leu 


Ala Ala He Leu Gly 


Phe Leu 


Pro 






195 










200 




205 






He 


Ser 


Gly 


He 


Phe 


Tyr 


Ser 


Tyr 


Tyr Lys He Val 


Ser 


Ser He 


Leu 




210 










215 




220 








Lys 


Val 


Ser 


Ser 


Ser 


Gly 


Gly Lys 


Tyr Lys Ala Phe 


Ser 


Thr Cys Gly 


225 










230 






235 






240 


Ser 


His 


Leu 


Ser 


Val 


Val 


Cys 


Leu 


Phe Tyr Gly Thr Ala 


Leu Gly Gly 










245 








250 




255 




Tyr 


Leu 


Ser 


Ser 


Asp 


Met 


Ser 


Ser 


Tyr Pro Arg Lys 


Gly Ala Val Ala 








260 










265 




270 




Ser 


Val 


Met 


Tyr 


Thr 


Val 


Val 


Ala 


Pro Met Leu Asn 


Pro 


Phe He 


Tyr 






275 










280 




285 






Ser 


Leu 


Arg 


Lys 


Arg 


Asp 


He 


Lys 


Ser Ala Leu Gin 


Gin 


Leu His 


Gly 




290 










295 




300 








Arg 


He 


Val 


Xaa 


Ser 


His 


Asp 


Leu 


He He Gly Ser 


He 


Leu Xaa 


Pro 


305 










310 






315 






320 


Trp Val 


Gly 


Lys Gly 


Ser 


Lys Val 


Lys 
















325 

















<210> 1355 
<211> 321 
<212> PRT 

<213> Unknown (H38g272 protein) 
<220> 

<223> Synthetic construct 
<400> 1355 



662 
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Met Glu Ser Pro Asn His Thr Asp Val Asp Pro Ser Val Phe Phe Leu 

15 10 15 

Leu Gly lie Pro Gly Leu Glu Gin Phe His Leu Trp Leu Ser Leu Pro 

20 25 30 

Val Cys Gly Leu Gly Thr Ala Thr He Val Gly Asn He Thr He Leu 

35 40 45 

Val Val Val Ala Thr Glu Pro Val Leu His Lys Pro Val Tyr Leu Phe 

50 55 60 

Leu Cys Met Leu Ser Thr He Asp Leu Ala Ala Ser Val Ser Thr Val 
65 70 75 80 

Pro Lys Leu Leu Ala He Phe Trp Cys Gly Ala Gly His He Ser Ala 

85 90 95 

Ser Ala Cys Leu Ala Gin Met Phe Phe He His Ala Phe Cys Met Met 

100 105 HO 

Glu Ser Thr Val Leu Leu Ala Met Ala Phe Asp Arg Tyr Val Ala He 

115 120 125 

Cys His Pro Leu Arg Tyr Ala Thr He Leu Thr Asp Thr He He Ala 

130 135 140 

His He Gly Val Ala Ala Val Val Arg Gly Ser Leu Leu Met Leu Pro 
145 150 155 160 

Cys Pro Phe Leu He Gly Arg Leu Asn Phe Cys Gin Ser His Val He 

165 170 175 

Leu His Thr Tyr Cys Glu His Met Ala Val Val Lys Leu Ala Cys Gly 

180 ^ 185 190 

Asp Thr Arg Pro Asn Arg Val Tyr Gly Leu Thr Ala Ala Leu Leu Val 

195 200 205 

He Gly Val Asp Leu Phe Cys He Gly Leu Ser Tyr Ala Leu Ser Ala 

210 215 220 

Gin Ala Val Leu Arg Leu Ser Ser His Glu Ala Arg Ser Lys Ala Leu 
225 230 235 240 

Gly Thr Cys Gly Ser His Val Cys Val He Leu He Ser Tyr Thr Pro 

245 250 255 

Ala Leu Phe Ser Phe Phe Thr His Arg Phe Gly His His Val Pro Val 

260 265 270 

His He His He Leu Leu Ala Asn Val Tyr Leu Leu Leu Pro Pro Ala 

275 280 285 

Leu Asn Pro Val Val Tyr Gly Val Lys Thr Lys Gin He Arg Lys Arg 

290 295 300 

Val Val Arg Val Phe Gin Ser Gly Gin Gly Met Gly He Lys Ala Ser 
305 310 315 320 

Glu 



<210> 1356 
<211> 327 
<212> PRT 

<213> Unknown (H38g273 protein) 
<220> 

<223> Synthetic construct 

<221> VARIANT 

<222> (1) . . . (327) 

<223> Xaa = Any Amino Acid 



<400> 1356 

Met Thr Trp Ser Gly Gly Thr He 

1 5 
Leu Leu Gly Phe Pro Ala Pro Ala 
20 

Leu Leu Leu Leu Ala Tyr Val Leu 



Val Gly Glu Xaa Gly Glu Phe Val 

10 15 
Pro Leu Gin Val Leu Leu Phe Ala 
25 30 
Val Leu Thr Glu Asn Thr Leu He 
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<400> 423 

atggaatctc ctaatcacac tgatgttgac ccttctgtct tcttcctcct gggcatccca 
ggtctggaac aatttcattt gtggctctca ctccctgtgt gtggcttagg cacagccaca 
attgtgggca atataactat tctggttgtt gttgccactg aaccagtctt gcacaagcct 
gtgtaccttt ttctgtgcat gctctcaacc atcgacttgg ctgcctctgt ctccacagtt 
cccaagctac tggctatctt ctggtgtgga gccggacata tatctgcctc tgcctgcctg 
gcacagatgt tcttcattca tgccttctgc atgatggagt ccactgtgct actggccatg 
gcctttgatc gctacgtggc catctgccac ccactccgct atgccacaat cctcactgac 
accatcattg cccacatagg ggtggcagct gtagtgcgag gctccctgct catgctccca 
tgtcccttcc ttattgggcg tttgaacttc tgccaaagcc atgtgatcct acacacgtac 
tgtgagcaca tggctgtggt gaagctggcc tgtggagaca ccaggcctaa ccgtgtgtat 
gggctgacag ctgcactgtt ggtcattggg gttgacttgt tttgcattgg tctctcctat 
gccctaagtg cacaagctgt ccttcgcctc tcatcccatg aagctcggtc caaggcccta 
gggacctgtg gttcccatgt ctgtgtcatc ctcatctctt atacaccagc cctcttctcc 
ttttttacac accgctttgg ccatcacgtt ccagtccata ttcacattct tttggccaat 
gtttatctgc ttttgccacc tgctcttaat cctgtggtat atggagttaa gaccaaacag 
atccgtaaaa gagttgtcag ggtgtttcaa agtgggcagg gaatgggcat caaggcatct 
gag 

<210> 424 
<211> 982 
<212> DNA 

<213> Unknown (H38g273 nucleotide) 
<220> 

<223> Synthetic construct 
<400> 424 

atgacatgga gtggcggaac catagtggga gagtgagtga gtttgtgttg ctggggcttc 
cctgctcctg cgccactaca ggtactattg tttgcccttt tgctgctggc ctatgtgttg 
gtgctgactg agaacacact catcattatg gcaattagga accattccac cctccacaaa 
cccatgtact tttttctagc taatatgtcc tttctggaga tctggtatgt cactgtcact 
attcccaaga tgcttgctgg ctttgttgga tccaaacagg atcatggaca gctaatctcc 
tttgagggat gcatgacaca gctctacttt ttccttggct tgggctgcac tgagtgtgtc 
cttctcgctg ttatggccta tgatcgctat atggccatct gctatcctct ccactaccca 
gtcattgtca gtggccggct gtgtgtgcag atggctgctg gctcttgggc tggaggtttt 
ggcatctcca tggtcaaagt ttttcttatt tctggcctct cttactgtgg ccccaacatc 
atcaaccact ttttctgtga tgtctctcca ttgctcaacc tctcatgcac tgatatgtcc 
acagcagagc ttacagattt catcctggcc atttttattc ttctagggcc actctctgtc 
actggggcct cctatgtggc cattactggt gctgtgatgc acataccttc ggctgctgga 
cgctataagg ccttttccac ctgtgcctct catctcactg ttgtgataat cttctatgca 
gccagtatct tcatctatgc tcggccaaag gcactctcag cttttgacac caacaagttg 
gtctctgtac tgtatgctgt cattgtacca ttgctcaatc ccatcattta ctgcctgcgc 
aatcaagagg tcaagagagc cctatgctgt actctgcacc ctgtaccagc accaggatcc 
tgaccccaag aaagctagca ga 

<210> 425 
<211> 936 
<212> DNA 

<213> Unknown (H3 8g274 nucleotide) 
<220> 

<223> Synthetic construct 
<400> 425 

atggaagcag gaaaccaaac aggattttta gagtttatcc ttctcggact ctctgaggat 
ccagaactac agccgttcat atttgggctg ttcctgtcca tgtacctggt gacggtgctg 
ggaaacctgc tcatcatcct ggccatcagc tctgactccc acctccacac ccccatgtac 
ttcttcctct ccaacctgtc ctgggttgac atctgtttca gcacttgcat cgtccccaag 
atgctggtga acatccagac cgagaacaaa gccatctcct acatggactg cctcacacag 
gtctatttct ccatgttttt tcctattctg gacacgctac tcctgaccgt gatggcctat 
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300 
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60 
120 
180 
240 
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360 
420 
480 
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720 
780 
840 
900 
960 
982 



60 
120 
180 
240 
300 
360 



173 



